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AB  ST R ACT 


Two  multivariate  classification  methods  have  been 

applied  to  discriminant  variables  measured  on  a  set  of  20 

Nevada  Test  Site  (NTS)  underground  nuclear  explosions  and  27 

North  American  Earthquakes.  The  classification  methods  used 

in  this  study  are  cluster  analysis,  and  linear  and  quadratic 

multiple  discriminant  analysis.  Discriminant  analysis 

requires  that  the  parameters  of  the  underlying  stochastic 

model  be  specified  a  priori,  which  is  usually  accomplished 

with  a  sample  training  set  of  events.  Cluster  analysis, 

however,  follows  a  non-parametric  approach  and  requires  no 

prior  information.  The  discriminant  variables  are  the  seven 

energy  ratios  calculated  for  these  events  by  Booker  and 

Mitronovas  (1964).  An  additional  discriminant  variable, 

Mg/m^  was  estimated  for  a  reduced  set  of  nine  earthquakes 

and  21  underground  explosions.  Four  M  / m,  estimates  were 

s  D 

obtained  for  each  event  in  this  subset  by  randomly 

generating  M  estimates  within  four  specified  variance 
s 

levels  about  the  Mg  versus  m^  lines  given  by  Alexander  and 

Lambert  (1973).  The  variances  used  in  generating  the  Mg 

estimates  are  multiples  of  the  variance  found  by  von  Seggern 

(  1  972  )  to  be  representative  of  M  measurements  for  NTS 

s 

explosions. 

Where  the  two  multiple  discriminant  analysis  methods 
were  used  to  classify  events  on  the  basis  of  the  seven 
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energy  ratios,  two  events  were  misclasslf led  in  both  cases* 

The  average  posterior  probability  of  correct  classification 

was  0.79  for  the  linear  analysis  and  0.95  for  the  quadratic 

analysis.  For  the  reduced  event  set  where  the  M  / m, 

8  0 

estimates  were  included,  no  events  were  misclassif ied  in  the 

quadratic  analysis  and  only  one  event  was  misclassif ied  in 

the  linear  analysis.  This  single  misclassif ication  occurred 

where  the  widest  variance  set  of  M  /m,  values  was  used.  In 

s  o 

all  cases  where  the  M  /m,  estimates  were  included,  the 

S  D 

average  posterior  probability  of  correct  classification  was 

found  to  be  greater  than  or  equal  to  0.  96. 

Application  of  cluster  analysis  methods  to  the  data 

failed  to  produce  homogeneous  groupings  at  the  two-cluster 

level,  both  in  the  case  where  the  seven  energy  ratios  were 

used  alone,  and  for  the  reduced  set  of  events  where  the 

M  / m.  ratios  were  included.  However,  where  the  M  / m, 
s  b  s  b 

estimates  were  used,  three  homogeneous  explosion  clusters 
and  one  earthquake  cluster  with  a  single  misplaced  explosion 
were  formed  at  the  four-cluster  level.  The  earthquake 
cluster  then  fused  with  an  explosion  cluster,  so  that 
homogeneous  clusters  did  not  persist  to  the  two-cluster 
level.  In  the  case  where  only  the  seven  energy  ratios  were 
used,  inhomogeneous  clusters  were  formed  at  the  earliest 
steps  in  the  clustering  process. 

Suggestions  have  been  made  for  improving  discrimination 
capabilities  based  on  recent  advances  in  seismic 
instrumentation  and  data  processing  techniques. 
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CHAPTER  I 


INTRODUCTION 

The  need  to  limit  the  testing  of  nuclear  devices  became 
evident  in  the  mid  1950's,  sparked  by  a  growing  concern  in 
the  scientific  community  over  the  increasing  amounts  of 
radioactivity  accumulating  in  the  biosphere.  Public 
awareness  began  to  focus  on  this  issue  in  1954,  after  the 
United  States  exploded  a  15  megaton  device,  Bravo,  in  an 
underwater  test  at  Eniwetok  Atoll  in  the  Bikini  Islands* 
Subsequent  fallout  from  this  test  had  been  much  greater  than 
anticipated  and  resulted  in  the  severe  exposure  of  a  number 
of  Marshall  Islanders  to  large  doses  of  radiation.  This 
sequence  of  events  reached  a  climax  when,  two  weeks  after 
the  explosion,  a  Japanese  fishing  vessel.  The  Lucky  Dragon, 
docked  at  Yaizu  Harbor  with  twenty  three  members  of  its  crew 
suffering  from  radiation  sickness  (Lapp,  1958).  These 
events,  combined  with  a  growing  scientific  awareness  of  the 
effect  of  radioactive  fallout  made  it  clear  that  some 
international  agreement  needed  to  be  reached  concerning  the 
future  of  nuclear  weapons  testing. 
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The  1958  Geneva  Conference  of  Experts 


The  first  international  political  action  to  evaluate 
the  consequences  of  nuclear  testing  and  to  study  ways  in 
which  a  realistic  test  ban  treaty  could  be  implemented  took 
place  at  Geneva  in  1958.  The  primary  task  of  this 
conference,  which  included  delegates  from  the  United  States 
and  the  Soviet  Bloc,  as  well  as  those  from  Britain,  France, 
and  Canada,  was  to  determine  whether  existing  methods  of 
detection  were  sufficient  to  ensure  a  test  ban  agreement. 
By  July  4,  1958,  an  agenda  had  been  adopted,  indicating  four 
areas  of  discussion.  These  were  (a)  an  exchange  of  opinions 
on  the  methods  for  detecting  nuclear  explosions,  (b) 
determination  of  methods  that  would  reliably  indicate  that 
an  explosion  has  taken  place,  (c)  agreement  on  a  system  for 
controlling  the  observance  of  a  test  ban  treaty,  and  (d) 
preparation  of  a  report  setting  forth  the  expert  conclusions 
and  recommendations  for  such  a  control  system  (Bolt,  1976). 

It  was  soon  agreed  that  existing  methods  were  adequate 
for  detecting  an  atmospheric  or  surface  test,  because  such 
an  explosion  generates  an  atmospheric  pressure  front  which 
is  easily  detected  on  microbarograph  recordings.  In 
addition,  radioactive  debris  and  electromagnetic 
disturbances  generated  by  a  nuclear  explosion  on  the  surface 
or  in  the  atmosphere  serve  as  reliable  indications  that  such 
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an  explosion  has  taken  place.  Prompted  by  these  results, 
the  delegates  to  the  conference  issued  the  following 
statement  (Bolt,  1976): 

"...  the  sensitivity  of  modern  physical, 
chemical,  and  geophysical  methods  of 
measurement  makes  it  possible  to  detect 
nuclear  explosions  at  considerable 
distances.  Thus,  it  is  known  that 
explosions  of  high  yield  which  are  set 
off  on  the  surface  of  the  earth  and  in 
the  lower  part  of  the  atmosphere  can  be 
detected  without  difficulty  at  points 
on  the  globe  that  are  very  remote  from 
the  site  of  the  explosion." 

On  the  other  hand,  agreement  on  suitable  methods  for 
detecting  underground  tests  appeared  to  be  much  more 
difficult  to  reach.  The  Soviet  delegation  was  confident 
that  existing  detection  methods  were  adequate  to  police  a 
test  ban  agreement;  however.  Western  delegates  did  not  share 
this  optimism.  The  debate  centered  on  the  determination  of 
the  minimum  magnitude  threshold  at  which  an  explosion  can  be 
detected.  The  Russian  opinion  held  that  the  vast  majority 
of  recorded  events  could  be  unambiguously  identified  as 
earthquakes.  For  example,  deep  shocks  of  magnitude  5.0  or 
greater  could  be  excluded,  as  could  events  occurring  near 
control  stations,  populated  areas,  and  at  great  depths  under 
the  ocean.  This  would  leave  only  a  very  small  number  of 
questionable  events,  perhaps  five  or  ten  in  a  year,  which 
would  require  further  investigation. 

The  British  position,  presented  by  Sir  Edward  Bullard, 
questioned  this  optimism.  Bullard  asserted  that  the  Russian 
arguments  were  based  on  data  obtained  under  ideal 
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conditions,  whereas  an  actual  control  network  would  be 
required  to  operate  under  realistic  conditions  of  noise,  and 
certainly  without  forewarning  of  impending  tests.  Bullard 
further  suggested  that  magnitude/yield  detection  thresholds 
might  be  reduced  by  incorporating  the  polarity  of  first 
motion  as  a  discriminant  criterion.  This  criterion  became 
quite  attractive  to  the  Geneva  delegates  because  it  was 
believed  to  represent  a  positive  identifier  of  explosions. 
That  is,  a  certain  pattern  of  first  motions,  one  that  is 
compressional  at  all  stations,  is  both  a  necessary  and 
sufficient  characteristic  to  indicate  that  the  event  in 
question  is  an  explosion.  The  first  motion  polarity 
criterion  led  to  a  great  deal  of  optimism  among  both  Eastern 
and  Western  delegates,  who  felt  that  90  percent  of  all 
continental  earthquakes  could  be  reliably  distinguished  from 
explosions  on  the  basis  of  the  first  motion  criterion  alone. 

Several  other  possible  approaches  to  discrimination 
were  also  discussed.  Most  earthquakes  are  found  to  occur  at 
depths  appreciably  greater  than  five  to  eight  kilometers, 
generally  considered  to  be  the  limit  to  which  an  emplacement 
hole  for  a  nuclear  device  can  be  drilled.  The  Geneva 
delegates  agreed  that  the  focal  depth  could  be  reliably 
determined  using  the  pP-P  interval  time.  Under  this  plan, 
if  an  event  was  found  to  have  a  focal  depth  less  than  some 
threshold  depth,  say  ten  kilometers,  it  would  be  classified 
as  a  suspicious  event.  Similarly,  the  location  process 
itself  was  suggested  as  a  discrimination  technique,  where 
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events  occurring  in  an  aseismic  region  would  be  considered 
suspicious,  and  chose  occurring  near  population  centers,  or 
under  deep  oceans  would  be  classified  as  natural  events. 

The  third  discrimination  technique  considered  at  Geneva 
is  based  on  theoretical  differences  in  the  partitioning  of 
energy  between  compressional  and  shear  wave  radiation 
fields.  The  source  function  for  an  underground  explosion 
may  be  approximated  by  a  pressure  pulse  applied  to  the 
surface  of  a  spherical  cavity,  centered  about  the  detonation 
point  of  the  explosion.  This  type  of  impulsive,  radially 
symmetric  source  generates  only  compressional  waves  in  an 
infinite  medium;  however,  boundary  interactions  occurring  in 
a  halfspace  or  a  multilayered  medium  result  in  the 
generation  of  Rayleigh  surface  waves  and  in  the  conversion 
of  some  of  the  P*wave  energy  into  SV*waves  at  the 
boundaries.  The  earthquake  source,  on  the  other  hand,  is  a 
faulting  mechanism  which  may  be  modelled  by  a  series  of 
double  couples.  This  type  of  source  entails  release  of 
shear  strain  energy  which  results  in  a  significant 
partitioning  of  energy  into  shear  wave  radiation.  In 
addition,  where  the  propagating  medium  may  be  represented  by 
one  or  more  layers  over  a  halfspace,  Love  surface  waves  may 
be  generated  because  of  the  horizontally  polarized  (SH) 
shear  wave  components  in  the  earthquake  radiation  field. 

Based  on  these  results,  the  seismic  signature  of  an 
underground  nuclear  explosion  should  contain  a  large 
fraction  of  compressional  wave  energy  and  no  SH  or  Love  wave 


energy,  whereas  the  signature  for  an  earthquake  should 

contain  a  larger  amount  of  both  SV  and  SH  shear  wave  energy, 

and  both  L^ve  and  Rayleigh  surface  wave  energy.  This 

fundamental  difference  in  energy  partitioning  at  the  source 

forms  the  basis  for  the  M  /m,  discriminant,  and  for  the 

s  b 

cluster  and  discriminant  analysis  schemes  presented  in  this 
paper . 


The  Geneva  Control  Scheme 

After  considering  the  various  diagnostic  techniques  to 
be  applied  to  the  discrimination  problem,  the  delegates  to 
the  Geneva  conference  turned  their  attention  to  the  question 
of  a  suitable  worldwide  control  system  to  monitor  a 
subsequent  treaty  agreement.  The  Russian  proposal  consisted 
of  a  network  of  approximately  100  acoustic  monitoring 
stations  to  detect  surface  and  atmospheric  blasts,  coupled 
with  existing  national  seismographic  stations  to  be  used  for 
underground  detection  purposes.  Such  a  system,  according  to 
the  Soviet  delegation,  would  have  the  capability  of 
detecting  underground  and  above-ground  detonations  to  a  one 
kiloton  threshold  yield  (Bolt,  1976).  Objections  to  this 
proposal  were  voiced  by  Western  delegates,  who  pointed  out 
that  many  of  the  national  stations  were  untrustworthy,  in 
that  they  were  poorly  equipped,  and  only  spottily  staffed. 


At  that  time  a  wide  variety  of  equipment  was  in  use,  with 
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different  frequency  bandwidths  and  instrument  response 
characteristics.  In  particular,  very  few  stations  had 
accurate  means  for  time  calibration,  so  that  the  variance  in 
origin  time  as  determined  by  such  a  network  might  be  as 
great  as  several  minutes. 

The  United  States'  plan  for  identifying  explosions  at 
the  one  kiloton  yield  threshold  based  on  first  motion 
polarity  required  a  total  of  650  stations  world-wide.  This 
proposal,  in  turn,  proved  untenable  to  the  Eastern  side,  so 
that  a  deadlock  ensued  until  a  compromise  proposal  was 
suggested*  British  delegate  William  Penny  proposed  that  the 
minimum  detection  yield  threshold  be  raised  from  one  kiloton 
to  five  kilotons,  which  would  allow  for  a  reduction  in  the 
required  number  of  monitoring  stations  from  650  to  170. 
This  suggestion  evolved  to  form  the  basis  for  the  Geneva 
Seismic  Detection  Network. 

The  so-called  Geneva  Network  stands  as  a  landmark  for 
seismology;  although  it  was  never  put  into  operation,  it  was 
to  become  the  basis  for  a  standardized  global  network.  In 
all,  170  stations  were  to  be  located  in  North  America, 
Europe  and  Asia,  with  some  stations  placed  on  islands  in  the 
Pacific  Ocean  and  Carribbean  Sea.  In  addition  to  the  land- 
based  stations,  10  acoustic  monitoring  stations  were  to  be 
placed  on  ships.  The  standardized  equipment  was  to  consist 
of  two  short-period  horizontal  and  ten  short-period  vertical 
instruments  distributed  over  two  or  three  kilometers  in  a 
small  aperture  array.  Long-period  instruments  were  also  to 
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be  included  at  each  station,  with  hydroacoustic  equipment 
included  at  the  island  stations  (Bolt,  1976). 

One  of  the  major  factors  preventing  the  implementation 
of  a  comprehensive  test  ban  treaty  is  the  question  of  on¬ 
site  inspection.  At  the  1958  Geneva  Conference,  this  was  a 
key  provision  for  the  West,  that  the  sites  of  so-called 
"residue  events",  that  is,  events  which  could  not  be 
unambiguously  identified  as  natural  earthquakes,  be  open  to 
inspection  by  all  concerned  parties.  This  proposal  proved 
to  be  most  unpalatable  to  the  Eastern  delegation,  and 
continues  to  be  a  major  stumbling  block,  even  in  present 
negotiations . 

Because  of  this  inability  to  reach  agreement  on  the 
fundamental  question  of  on-site  inspection,  and  in  light  of 
some  rather  disturbing  developments  revealed  in  the  October, 
1958  Hardtack  II  tests,  plans  for  a  comprehensive  test  ban 
treaty  were  temporarily  suspended,  and  the  Geneva  conference 
adjourned.  Upon  examination  of  the  records  from  Hardtack 
II,  it  was  found  that  several  stations  had  recorded  the 
earlier  Rainier  blast,  the  model  used  in  the  Geneva  talks, 
with  an  anomalously  high  amplitude.  This  led  to  a  downward 
revision  of  the  magnitude  for  Rainier  from  4.6  to  4.1. 
Because  so  many  more  earthquakes  occur  at  this  smaller 
magnitude  each  year,  nearly  twice  as  many  events  would  have 
to  be  examined  for  a  given  magnitude  yield  threshold.  In 
addition,  it  was  found  that  the  criterion  of  first  motion 
polarity,  then  thought  to  be  the  only  positive 


identification  criterion  available,  was  reliable  only  at 


noise  levels  much  lower  than  was  previously  thought 
necessary . 

These  discoveries  brought  home  the  need  for  massive 
improvements  in  seismic  techniques.  In  the  United  States, 
new  funding  was  provided  for  seismic  improvement  under 
project  Vela  Uniform,  and  similar  research  and  development 
operations  were  established  abroad,  perhaps  most  notably  the 
continuing  research  effort  at  Blacknest  in  Great  Britain. 

Under  project  Vela  Uniform,  initiated  by  the  Advanced 
Research  Projects  Agency  of  the  Department  of  Defense,  250 
million  dollars  were  spent  to  improve  seismic  capability  in 
the  United  States  and  abroad  between  1960  and  1971.  Some  of 
this  funding  was  directed  toward  seismological  research 
although  much  of  it  was  required  to  upgrade  detection 
capabilities  and  instrumentation.  For  example,  it  was 
during  this  period  that  the  World  Wide  Standard  Seismograph 
Network  (WWSSN)  was  established,  as  well  as  numerous  seismic 
arrays,  such  as  the  Large  Aperture  Seismic  Array  (LASA),  the 
Alaskan  Long  Period  Seismic  Array  (ALPA),  and  the  Norwegian 
Seismic  Array  (NORSAR). 


Further  Negotiations  and  Treaties 


Between  1958  and  1965,  considerable  progress  had  been 
achieved  in  detection  of  blasts  in  the  atmosphere  and 
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underwater.  In  these  instances,  the  problem  is  considerably 
simplified  by  the  fact  that  the  detonation  medium  is  an 
acoustic  medium,  without  most  of  the  complex  propagation 
characteristics  of  the  solid  earth.  Particularly  important 
is  the  fact  that  neither  the  atmosphere  nor  the  oceans  have 
a  competing  background  of  natural  events  of  similar 
character  and  magnitude. 

The  reverse  is  true  in  the  case  of  seismological 
detection  of  underground  explosions,  where  a  perpetual 
background  of  natural  seismicity  is  present,  so  that  an 
effective  method  for  differentiating  between  an  underground 
explosion  and  a  natural  earthquake  is  required. 

In  light  of  these  results,  a  limited  test  ban  treaty 
was  developed  through  negotiations  at  Geneva  in  1963.  In 
effect,  the  treaty  bans  all  atmospheric  and  underwater 
testing,  and  severely  resticts  surface  detonations  by 
requiring  that  radioactive  debris  be  contained  entirely 
within  the  territorial  limits  of  the  nation  in  which  the 
explosion  is  detonated. 

Weapons  testing  has  been  further  restricted  in 
subsequent  treaties  banning  tests  in  Antarctica  (1959),  the 
Treaty  on  Principles  Governing  the  Activities  of  States  in 
the  Exploration  and  Use  of  Outer  Space  (1967),  and  the 
Treaty  on  the  Non-Proliferation  of  Nuclear  Weapons  (1968). 
However,  underground  testing  has  continued  unabated  since 
the  ratification  of  the  Limited  Test  Ban  Treaty  in  1963. 

In  1974,  India  became  the  sixth  nuclear  power  (the 


other  five  are  the  United  States,  the  Soviet  Union,  Great 
Britain,  France,  and  the  Peoples'  Republic  of  China).  This 
event  reawakened  concern  over  the  proliferation  of  nuclear 
weapons  and  renewed  the  apparently  long  dormant 
international  interest  in  banning,  or  at  least  limiting,  the 
underground  testing  of  nuclear  weapons.  As  a  direct 
consequence  of  the  Indian  nuclear  test,  the  Threshold  Test 
Ban  Treaty  was  drafted  in  1974  .  This  treaty  was  initialled 
by  both  the  United  States  and  the  Soviet  Union,  and  although 
the  treaty  was  never  brought  before  the  United  States  Senate 
for  ratification,  both  sides  informally  agreed  to  abide  by 
its  provisions.  In  essence,  the  1974  Limited  Test  Ban 
Treaty  stipulates  that  underground  nuclear  testing  be 
limited  to  devices  of  150  kiloton  yield  or  less,  well  above 
the  threshold  detection  and  identification  capabilities  of 
either  side. 

With  the  continued  Soviet  opposition  to  on-site 
inspection,  most  research  since  1958  has  focused  on  those 
detection  and  discrimination  criteria  which  are  applicable 
at  teleseismic  distances.  However,  recent  talks  in  Geneva 
indicate  the  possibility  that  a  comprehensive  test  ban 
treaty  may  be  drafted  in  the  near  future.  This  has  been 
brought  about  by  the  apparent  willingness  of  each  side  to 
allow  a  certain  number  of  stations  to  be  installed  and 
operated  within  its  national  borders  by  the  other.  Given 
this  possibility  of  a  network  of  internal  monitoring 
stations,  it  has  become  imperative  to  develop  regional 
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discrimination  techniques  to  complement  current  teleseismic 
methods  and  better  ensure  compliance  with  a  test  ban  treaty. 
Research  continues  on  seismic  discrimination,  and  regional 
techniques  which  have  received  relatively  little  attention 
in  the  past  are  now  quickly  growing  in  importance, 
especially  in  light  of  the  recent  advances  in  seismic 
instrumentation.  A  primary  example  is  the  recently 
developed  Seismic  Research  Observatory  (SRO)  Network,  where 
data  is  automatically  digitized  and  stored  on  magnetic  tape. 
Three  outstanding  features  of  this  network  are  its  wide 
dynamic  range,  excellent  signal-to-noise  characteristics, 
and  on-site  digital  recording  which  allows  the  data  to  be 
routinely  subjected  to  sophisticated  digital  analyses. 

The  remainder  of  this  paper  is  devoted  to  describing 
various  regional  discriminant  criteria  and  to  illustrating 
several  practical  techniques  by  which  these  criteria  may  be 
employed.  Among  these  techniques  are  multivariate  linear 
and  quadratic  discriminant  analysis,  and  several  types  of 
cluster  analysis.  These  techniques  are  applied  to  the  set 
of  energy  ratios  measured  on  a  group  of  earthquakes  and 
underground  nuclear  explosions  examined  originally  by  Booker 
and  Mitronovas  (1964).  This  data  set  is  augmented  with 
estimated  Mg/m^  ratios  which  have  been  generated  in  such  a 
way  as  to  reflect  the  uncertainties  typical  of  western 
United  States  earthquakes  and  Nevada  Test  Site  explosions. 

The  last  part  of  this  paper  contains  the  description  of 
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a  general  spectral  analysis  program  which  may  be  used  to 
calculate  spectral  discriminants  based  on  the  event's 
spectrogram,  a  two-dimensional  contour  display  of  power 
spectral  density  as  a  function  of  both  time  (or  velocity) 
and  frequency.  Alternatively,  the  program  may  be  used  to 
generate  discriminants  based  on  the  spectra  of  time  or 
velocity  windows  choaen  at  any  position  within  the  record- 
An  interesting  feature  of  this  program  is  that  spectra  for 
long  windows  are  calculated  by  averaging  the  spectra  of  the 
shorter  windows  used  to  obtain  the  event  spectrogram,  so 
that  only  one  set  of  power  spectra  need  be  calculated.  When 
coupled  with  the  digital  recording  capabilities  of  the  SRO 
Network,  criteria  such  as  the  spectral  and  energy  ratios 
described  above  may  be  routinely  and  economically  applied. 
This  approach  provides  a  practical  means  of  using  the  entire 
seismic  signature  in  a  quantitative  manner  for  improved 
discrimination  capabilities. 


CHAPTER  II 


A  BACKGROUND  FOR  THE  DISCRIMINANTS 
USED  IN  THIS  STUDY 


The  discriminants  which  might  be  employed  in  an 
earthquake-underground  nuclear  explosion  discrimination 
scheme  may  be  divided  into  two  groups*  The  first  group 
consists  of  those  discriminants  which  indicate  the  origin  of 
a  seismic  event  on  the  basis  of  geographic  considerations* 
For  example,  an  event  located  at  great  depth  or  close  to  a 
major  population  center  may  be  identified  as  a  natural 
earthquake  with  a  high  degree  of  certainty.  On  the  other 
hand,  a  shallow  event  occurring  in  a  comparatively  aseismic 
region  may  be  regarded  as  suspicious.  Where  adequate 
station  coverage  is  available  so  that  accurate  seismic 
location  and  depth  determinations  may  be  routinely  obtained 
by  seismological  cataloging  agencies,  these  geographic 
discriminants  may  be  readily  utilized  to  classify  the 
majority  of  recorded  events. 

Those  events  which  cannot  be  classified  using  simple 
geographic  criteria  must  then  be  subjected  to  more 
sophisticated  analyses,  based  on  theoretical  or  empirical 
differences  between  the  earthquake  and  underground  explosion 
source  characteristics.  Factors  contributing  to  these 


differences  are  Che  rupture  mechanism,  the  fault  dimensions 


and  geometry,  and  the  source  time  history.  Considerable 
effort  has  been  made  In  recent  years  to  model  the  earthquake 
and  explosion  source  mechanisms  on  the  basis  of  observed 
radiation  patterns  and  to  justify  these  models  physically. 
As  a  result  of  this  research,  numerous  discriminants  have 
been  developed  to  differentiate  between  earthquakes  and 
underground  explosions  on  the  basis  of  the  observable 
features  of  their  respective  radiation  fields. 


A  Brief  Survey  of  Earthquake  and  Underground  Explosion 

Source  Theory 

One  of  the  most  complex  and  difficult  problems  in 
seismology  is  the  development  of  a  source  theory  to  explain 
in  detail  the  features  of  the  observed  seismic  signatures  o r 
earthquakes  and  underground  explosions.  Several  hypotheses 
have  been  advanced  concerning  the  nature  of  the  seismic 
focal  mechanism,  some  of  which  are  quite  successful  in 
explaining  major  characteristics  of  the  observed  radiation 
fields. 

One  of  the  earliest  attempts  to  explain  the  earthquake 
source  mechanism  is  the  elastic  rebound  theory  of  fracture 
formulated  by  Reid  (1933).  Based  on  observations  of  the  1906 
San  Francisco  earthquake,  Reid  concluded  that  an  earthquake 
occurs  as  the  result  of  the  accumulation  of  elastic  stresses 
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within  a  given  region  of  the  Earth.  These  stresses  slowly 
increase  with  time  until  they  exceed  the  strength  of  the 
medium,  at  which  point  the  accumulated  stresses  are  relieved 
by  fracture.  This  theory  may  be  summarized  in  the  following 
statements  (Stacey,  1969): 


1.  The  earthquake  is  produced  by  fracture  which  is  the 
result  of  elastic  strains  that  exceed  the  strength  of 
the  material.  These  strains  are  produced  by  relative 
displacements  of  neighboring  portions  of  the  Earth's 
crus  t . 

2.  The  relative  displacements  are  not  produced  suddenly  at 

the  time  of  fracture,  but  accumulate  slowly  over  a 

period  of  time. 

3.  The  only  mass  movements  at  the  time  of  fracture  are  due 

to  the  sudden  elastic  rebound  of  the  sides  of  the 

fracture  toward  equilibrium.  These  displacements  extend 
only  a  few  kilometers  in  a  direction  normal  to  the 

fracture  surface. 

4.  Elastic  waves  propagate  from  the  sides  of  the  fracture, 
the  surface  of  which  has,  at  first,  a  small  area.  This 
area  may  become  large,  but  not  at  a  rate  greater  than 
the  compr essional  wave  velocity  of  the  material  (Note: 
There  has  been  some  debate  in  recent  years  concerning 
the  speed  of  rupture  propagation.  Some  authors  (Press, 
et  al.,  1961)  believe  the  rupture  velocity  to  be  less 
than  the  S-wave  velocity  of  the  medium.) 

5.  The  energy  liberated  in  an  earthquake  is  stored  prior  to 
fracture  in  the  form  of  elastic  strain  energy. 


By  examination  of  the  distribution  of  first  motion 
polarities,  it  was  found  that  identical  first  motion 
patterns  could  be  obtained  by  representing  the  source  in 
terms  of  an  equivalent  point  source  (Stauder,  1962;  Hodgson 
and  Stevens,  1964;  Ke il is-Bo r ok ,  1960).  Honda  (1961) 
considered  both  the  single  couple  and  the  double  couple 
point  source  representations,  referring  to  them  as  the  type 
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I  and  type  II  sources,  respectively.  He  found  that  the 
double  couple  representation  is  equivalent  to  a  system  of 
forces  acting  within  a  spherical  volume  element  where  the 
net  moment  is  balanced. 

Both  the  single  and  double  couple  point  source 
representations  result  in  an  identical  pattern  of  P-wave 
first  motions  polarities.  However,  the  S-wave  radiation 
pattern  resulting  from  a  single  couple  source  is  twin-lobed, 
whereas  that  resulting  from  the  double  couple  source  is 
four-lobed.  Studies  of  earthquake  first  motion  radiation 
patterns  (Stauder,  1962;  Stevens,  1969)  indicate  that  the 
double  couple  point  source  is  generally  the  more  accurate 
representation  for  the  earthquake  source.  Moreover,  the 
single  couple  point  source  entails  a  net  moment  which 
implies  a  rotation  of  the  fault  surface.  This  phenomenon  is 
rarely  observed  for  shallow  events;  however,  Chinnery  (1960) 
has  suggested  that  plasticity  effects  might  account  for 
single  couple  type  radiation  from  deeper  events. 

Ben-Menahem  (1961)  studied  surface  wave  excitation 
using  a  finite  line  distribution  of  directed  point  sources, 
and  simulated  the  effect  of  a  propagating  fracture  in  a 
half-space  by  integrating  the  elementary  line  source 
solutions  assuming  a  finite  rupture  velocity.  Harkrider 
(1964)  and  Ben-Menahem  and  Harkrider  (1964)  studied  surface 
wave  excitation  in  a  layered  earth  model,  using  the  directed 
point  source  to  form  couples,  double  couples,  and  higher 
order  multipole  source  terms. 
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The  rupture  mechanism  involved  in  the  elastic  rebound 
theory  was  originally  thought  to  be  that  of  simple  Coulomb 
fracture.  However,  Coulomb  fracture  provides  a  plausible 
explanation  only  in  the  case  of  very  shallow  earthquakes. 
Because  the  coefficient  of  internal  friction  for  most  rock 
materials  is  on  the  order  of  unity,  frictional  sliding  may 
occur  only  if  the  shearing  stress  is  of  the  same  order  of 
magnitude  as  the  normal  stress  (roughly  equivalent  to  the 
overburden  pressure).  However,  the  overburden  pressure 
increases  on  the  average  by  about  300  bars  per  kilometer  in 
depth,  whereas  the  shear  strength  in  the  crust  is  thought  to 
be  on  the  order  of  a  few  hundred  bars  in  magnitude,  and  even 
less  in  the  upper  mantle  (Stacey,  1969).  Thus  at  depth, 
accumulation  of  shear  stress  should  result  in  bodily 
deformation  (creep)  rather  than  frictional  sliding.  This 
argument  is  nullified  somewhat  by  the  introduction  of  large 
hydrostatic  pore  pressures,  such  as  those  resulting  from 
metamorphic  processes,  which  reduce  the  effective  normal 
stress  across  the  fracture  surface.  It  is  thought,  however, 
that  even  this  modified  Coulomb  fracture  is  not  a  major 
contributing  factor  at  depths  greater  than  25  kilometers 
(Stacey,  1969). 

Attempts  to  explain  the  physics  of  the  rupture 
phenomenon  have  resulted  in  two  fairly  successful  earthquake 
models.  Steketee  (1958)  demonstrated  the  applicability  of 
elastic  dislocation  theory  to  the  earthquake  source  problem. 
In  this  case,  rupture  formation  is  equivalent  to  the 


instantaneous  removal  of  stresses  across  the  fault  surface, 
which  is  also  equivalent  to  the  sudden  application  of 
stresses  in  an  unstressed  medium  (Turnbull,  1  9  7  6  )  • 

Equivalent  point  sources  for  the  dislocation  model  have 
been  considered  by  Chinnery  (1960,1961),  Knopoff  and  Gilbert 
(I960),  and  Burridge  and  Knopoff  (1964).  Based  on 
equilibrium  considerations,  Chinnery  (1960)  concluded  that 
the  double  couple  representation  is  the  most  likely  for 
near-suface  faults.  In  the  case  of  the  single  couple,  the 
presence  of  an  unbalanced  moment  implies  that  the  medium 
readjusts  to  equilibrium  at  some  time  after  the  first 
emission  of  seismic  waves.  Chinnery  (1960)  suggested  that 
such  a  situation  might  occur  at  depth  if  plasticity  effects 
in  the  rupture  zone  are  taken  into  account.  Knopoff  and 
Gilbert  (1960)  modelled  the  earthquake  fault  plane  in  terms 
of  a  geometric  discontinuity  in  strain  or  displacement,  and 
derived  the  double  couple  representation  for  the 
displacement  discontinuity.  They  also  found  that  a 
discontinuity  in  shear  strain  is  equivalent  to  a  directed 
point  force  representation,  and  that  shear  strain  release  in 
a  laminar  region  (en  echelon  faulting)  may  be  represented  by 
the  single  couple  source. 

The  point  source  dislocation  model  has  been  used  to 
estimate  the  fault  length,  stress  drop,  and  seismic  moment 
from  observed  radiation  fields  based  on  theoretical  results 
obtained  by  Brune  (1970),  and  Hanks  and  Wyss  (1972). 

Archambeau  (1968)  has  developed  a  rather  formidable 
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source  theory  where  the  rupture  process  is  modelled  in  terms 
of  a  volume  relaxation  phenomenon,  described  mathematically 
as  an  initial  value  problem.  In  this  theory,  the  dynamical 
properties  of  the  source  are  described  in  terms  of  the 
relaxation  or  readjustment  of  pre-existing  stress  fields. 
The  initial  shear  stress  field  is  taken  to  be  the  primary 
cause  of  rupture.  Seismic  effects  are  the  result  of  the 
relaxation  of  this  prestress  shear  field  in  the  medium 
surrounding  a  region  whose  physical  properties,  rigidity  in 
particular,  have  changed  suddenly. 

The  tectonic  source  is  viewed  in  terms  of  the  release 
of  potential  strain  energy  from  within  a  nonelastic  zone  in 
the  medium.  Within  this  nonlinear  rupture  zone,  stress 
release  is  achieved  by  flow,  fracture,  or  phase  change,  so 
that  strain  energy  within  the  rupture  volume  is  dissipated 
in  the  work  of  nonelastic  deformation. 

This  dynamic  relaxation  model  requires  that  the  stress 
field  surrounding  the  rupture  zone  readjust  to  equilibrium 
in  a  fashion  determined  by  the  boundary  conditions  on  the 
rupture  surface  once  rupture  is  initiated.  This 
readjustment  is  accomplished  by  seismic  radiation  wherever 
the  medium  is  elastic. 

The  radiation  field  may  be  characterized  as  a  multipole 
expansion.  For  separable  source-time  functions,  the 
multipole  coefficients  are  dependent  only  upon  source 
geometry.  For  nonseparable  source-time  functions,  however, 
the  multipole  coefficients  are  frequency  dependent 


(Turnbull,  1976). 

A  property  of  the  earthquake  source  models  described 
above  which  is  of  central  importance  to  the 
earthquake/underground  explosion  discrimination  problem  is 
that  the  earthquake  source  may  be  described  as  a  shearing 
mechanism.  This  leads  to  the  result  that  the  radiation 
field  may  be  characterized  in  terms  of  dipolar  and  higher 
order  multipole  contributions  (Turnbull,  1976).  Where  this 
is  the  case,  a  significant  portion  of  the  radiated  energy  is 
propagated  in  the  form  of  shear  (SH  and  SV)  body  waves  and 
Love  and  Rayleigh  surface  waves.  In  comparison,  the 
explosive  source,  represented  as  a  point  dilatational 
source,  generates  only  compressional  wave  radiation  at  the 
source. 

The  source  mechanism  corresponding  to  an  underground 
explosion  has  been  described  by  Sharpe  (1946)  and  Carpenter 
(1967).  Schematically,  the  explosion  source  mechanism  may 
be  described  as  follows:  At  the  instant  of  detonation, 
material  immediately  surrounding  the  detonation  point  is 
vaporized.  A  shock  front  proceeds  radially  outward. 
Because  the  initial  stresses  associated  with  this  shock  wave 
greatly  exceed  the  material  strength  of  the  containing 
medium,  there  is  a  zone  of  fracture  where  energy  is 
dissipated  in  the  work  of  nonelastic  deformation.  At  some 
radial  distance  from  the  detonation  point,  sufficient  energy 
has  been  dissipated  for  the  laws  of  linear  elastic 
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deformation  to  apply.  At  this  point,  energy  is  dissipated 
in  the  form  of  seismic  radiation.  The  underground  explosion 
source  has  been  modelled  in  terms  of  an  outward  directed 
pressure  pulse  applied  to  the  surface  of  a  spherical  cavity, 
which  is  equivalent  to  a  radially  symmetric  point  dilational 
source.  In  an  infinite  medium,  the  expected  radiation  field 
from  a  cavity  source  is  purely  compress ional .  In  a  layered 
halfspace,  boundary  effects  result  in  some  conversion  of  the 
P  wave  radiation  to  shear  (SV)  body  waves  and  Rayleigh 
surface  waves.  However,  Gilbert  (1973),  and  Douglas, 
Hudson,  and  Kembhavi  (1971)  have  shown  that  the  partitioning 
of  energy  into  the  shear  wave  radiation  field  at  the  source 
results  in  a  greater  excitation  of  surface  waves  by  the 
earthquake  source.  Moreover,  discounting  the  effects  of 
non-linear  mode  conversion,  no  SH  shear  body  waves  or  Love 
surface  waves  are  generated  by  the  explosive  source. 

The  theoretical  seismic  radiation  patterns  for  a 
spherical  cavity  source  assume  a  purely  monopole 
distribution.  However,  a  significant  quadrupole  contribution 
has  been  observed  in  the  SH-wave  and  Love  wave  radiation 
fields  for  many  explosions  (Archambeau  and  Sammis,  1970; 
Archambeau,  1972).  Archambeau  (1972)  suggested  that  this 
anomalous  radiation  field  may  be  contributed  by  a 
combination  of  stress  relaxation  accompanying  the  creation 
of  a  fracture  zone  in  a  prestressed  medium,  and  by  weak 
zones  and  faulting  in  the  immediate  vicinity  of  the 
detonation  point,  leading  to  secondary  ruptures  with 
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definite  fault  symmetries.  Radiation  caused  by  the  first 
mechanism  is  quadrupolar  in  nature,  whereas  that  from  the 
second  exhibits  dipolar  and  higher  multipole  contributions. 
This  technique  has  been  applied  by  Archambeau  and  Sammis 
(1970)  to  obtain  source  parameters  from  the  Bilby 
underground  explosion. 


Discriminants  Based  on  Earthquake /Underground  Explosion 

Source  Differences 


Several  studies  have  been  made  where  the  expected 
displacement  amplitudes  for  the  earthquake  source  have  been 
compared  with  those  from  a  buried  explosive  source. 
Douglas,  Hudson  and  Kembhavi  (1971)  studied  the  relative 
excitation  of  body  and  surface  waves  for  both  earthquake  and 
underground  explosion  point  source  models.  They  concluded 
that  the  earthquake  source,  represented  as  a  point  double 
couple,  is  a  significantly  more  efficient  generator  of  shear 
body  wave  and  Rayleigh  surface  wave  radiation  than  the 
explosive  source,  modelled  as  a  point  dilatational  source. 
A  similar  result  has  been  obtained  by  Gilbert  (1970,  1973). 

Numerous  discriminants  have  been  developed  to 
differentiate  between  earthquakes  and  underground 
explosions.  In  addition  to  the  depth,  geographic,  and  first 
motion  discriminants  already  mentioned,  several 
discriminants  have  been  proposed  which  are  based  upon 
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differences  in  the  relative  excitation  of  body  and  surface 
waves  between  the  earthquake  source  and  the  explosion 
source • 

A  discriminant  involving  the  difference  between 

long-period  surface  wave  magnitude  (Mg)  and  short-period 

body  wave  magnitude  ( )  was  originally  suggested  by  Press, 

et  al,  (1963)*  Subsequent  research  has  shown  the  M  -m. 

s  b 

discriminant  to  be  one  of  the  most  effective  teleseismic 
discriminants  available  (Thirlaway,  1968;  SIPRI,  1968; 
Basham,  1969;  Liebermann  and  Pomeroy,  1969;  Capon,  et  al, 
1969;  Evernden,  1969).  For  events  recorded  at  teleseismic 
distances,  M^-m^  reliably  separates  earthquakes  and 
underground  explosions  for  m,  values  as  low  as  5.0 

D 

(Thirlaway,  1968;  SIPRI,  1968)  or  4.75  (Evernden,  1969). 
Recent  studies  also  show  the  need  for  regionalization  of 
Mg-mb  to  account  for  differing  attenuation  characteristics 
of  various  source-receiver  paths  (Marshall  and  Basham, 
1972  )  . 

Frequency  domain  studies  have  been  used  to  suggest 
several  possible  spectral  discriminants  (Aki,  1967;  Brune, 
1970;  Wyss  et  al.,  1971;  Hasegawa,  1972;  Alexander  and 
Lambert,  1973).  Ir  essence,  these  discriminants  reflect  the 
greater  amount  of  low  frequency  energy  expected  in 
earthquake  spectra.  This  is  a  result  of  differences  in 
source  dimensions  and  source  time  histories  between 
earthquakes  and  underground  explosions. 


Other  discriminants  have  been  suggested,  such  as  those 
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based  upon  the  greater  complexity  of  the  earthquake 
signature  (SIPRI,  1968),  and  related  discriminants  suggested 
by  autoregressive  modelling  studies  applied  to  the 
earthquake  and  explosion  signatures  (Tjostheira,  1975). 

The  discriminants  used  for  the  analysis  presented  in 
this  paper  are  chosen  to  reflect  the  theoretical  differences 
in  energy  partitioning  between  the  compress ional  and  shear 
wave  radiation  fields  for  earthquakes  and  underground 
explosions.  These  discriminants  consist  of  a  set  of  energy 
ratios  first  studied  by  Booker  and  Mitronovas  (1964),  and 
estimated  M  / m,  ratios.  M  / m,  ratios  were  used  instead  of 
Ms-m^  differences  because  the  Mg/m^  ratio  is  formally 
consistent  with  the  Booker  and  Mitronovas  energy  ratio 
discriminants. 

Assuming  a  linear  relationship  between  Mg  and  m^  of  the 

form  Mg=Am^+B  ,  which  is  valid  for  a  given  range  of 

values,  the  Mg-m^  discriminant  parameterizes  the  slope  A 

upon  m^ ,  whereas  the  Mg/m^  discriminant  parameterizes  the 

constant  B  upon  m,  .  Because  the  m,  values  used  to  form  the 

d  b 

Mg/m^  discriminant  in  this  study  vary  over  a  reasonably 

small  range  of  m^  which  is  nearly  the  same  for  both 

populations,  and  because  the  slopes  for  both  populations  are 

nearly  unity,  the  M  / m.  discriminant  yields  results  which  do 

rot  differ  significantly  from  those  obtained  using  the  M  -m, 

s  b 

discriminant.  F-statistics  for  both  cases  are  given  in 


Table  (V-l) 
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CHAPTER  III 


MULTIVARIATE  CLASSIFICATION  TECHNIQUES 


Once  a  suitable  set  of  discriminants  has  been  obtained, 
several  methods  may  be  used  to  incorporate  the  set  into  a 
multivariate  classification  scheme.  Two  techniques  are 
discussed  in  this  paper. 

The  first  involves  multiple  discriminant  analysis 
theory,  where  an  event  is  classified  according  to  the 
magnitude  of  a  likelihood  function  derived  from  a  Gaussian 
probabilistic  model.  Both  linear  and  quadratic 
classification  schemes  are  discussed. 

The  second  technique  is  based  on  several  cluster 
analysis  procedures.  The  cluster  analysis  approach  differs 
fundamentally  from  that  of  discriminant  anlysis,  in  that  no 
a  priori  probabilistic  model  is  assumed.  Instead,  groups, 
or  clusters,  are  allowed  to  form  simply  on  the  basis  of  the 
optimization  of  a  specified  similarity  criterion.  The 
immediate  attraction  of  this  non-parametr ic  approach  is 
offset,  however,  by  the  relatively  poor  performance  of 


clustering  techniques  compared  to  that  of  the  discriminant 
analysis  procedures  described  below. 


1 
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Multiple  Discriminant  Analysis ;  Theory 

In  this  section,  the  problem  of  constructing  a  method 
for  grouping  or  identifying  similar  events  is  discussed  in 
terms  of  multiple  discriminant  analysis.  This  method 
incorporates  a  probabilistic  interpretation  to  the  problem, 
in  that  the  event  is  considered  to  be  a  random  observation 
from  one  of  several  different  possible  stochastic 
populations.  The  desired  objective  is  to  identify  that 
population  from  which  the  observation  originates.  This  is 
accomplished  by  constructing  and  testing  the  hypotheses,  H^, 
that  the  observation  originates  from  the  ith  population. 

The  hypothesis  testing  may  be  carried  out  according  to 
one  of  several  criteria  of  optimality.  All  of  the  criteria 
considered  lead  to  a  simple  test  involving  comparisons  of 
the  magnitudes  of  the  likelihood  functions  for  each  group. 
Moreover,  where  only  two  groups  are  in  question,  this 
procedure  is  further  simplified  to  a  binary  likelihood  ratio 
test. 

The  problem  of  multiple  discriminant  analysis  was 
originally  formulated  by  Fisher  (1936),  and  has  since  been 
considered  in  some  detail  by  Anderson  (1958),  Anderson  and 
Badahur  (1962),  Whalen  (1971),  and  Rao  (1973).  The 
treatment  given  here  is  derived  principally  from  the  work  of 


these  latter  authors. 


Classification  Into  One  of  Several  Populations 

Let  x  represent  a  p-component  random  vector  of 
observations  originating  from  one  of  q  populations, 
A^,...,A^.  To  classify  a  specific  realization  of  x,  say  xq  , 
into  one  of  these  populations,  the  following  q  hypotheses 
are  formed: 

Xq  originates  from  A^, 

x  originates  from  A  . 
o  q 

The  problem  now  becomes  one  of  testing  the  hypotheses, 
,  and  identifying  xq  with  the  population  A^  with  the 

largest  probability  that  is  true.  This  is  done  according 
to  some  criterion  which  weighs,  either  directly  or 

indirectly,  the  effects  of  incorrectly  classifying  the  given 
observation. 

The  probability  that  the  hypothesis  is  true  for  a 

specific  realization  xq ,  of  the  random  vector  x,  may  be 

found  from  Bayes'  Rule.  Let  P  represent  the  event  that 
is  true,  and  let  Q  represent  the  event  that  x  -  xq .  Then 
the  marginal,  conditional,  and  joint  probabilities  of  P  (x 
originates  from  H^)  and  Q  (x  =  xq)  are  given  by: 


Pr{P> 


Pr{Hi> 


marginal  probability  that  the 


hypothesis  H  is  true, 

Pr{Q>  *  Pr{x-x  >  -  marginal  probability  that  x  - 

V 

Pr{P|Q>  -  Pr{Hi|x-xo>  *  probability  that  is 

true  given  that  x  «*  x  , 

o 

Pr<Q|P>  ■  Pr<x-xo|Hi>  »  probability  that  x  «  xq  , 
given  that  is  true, 

Pr { P , Q >  -  Pr { Q , P }  -  Pr{H  ,x-x  >  -  Pr { x-x  ,H  >  - 

x  o  o  l 

joint  probability  of  x  -  xq  and  . 


Since  the  joint  probability  of  P  and  Q  is  given  by: 

Pa  (*•  o!  *  ** /?/<*} 

.  (in  -  d 

it  is  possible  to  obtain  the  following  expression  for 
Pr { P | Q } : 


SPGzrr* 
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Pa  (  P/  Q  j  * 


Pa  /$  Ip]  ?*{?>} 

P-fQj 


(III  -  2) 
(Bayes'  Rule) 


Pr{Q|P>  and  Pr{Q>  may  be  rewritten  in  terms  of  the  marginal 

and  conditional  probability  density  functions  of  Q. 

Recalling  that  Pr(Q|P}  is  the  conditional  probability  that  x 

-  x  ,  given  that  H.  is  true,  and  that  Pr{Q>  is  the  marginal 
o  i 

probability  that  x  -  xq,  the  probability  that  x  is  located 

in  a  p-dimensional  volume  dV  -  dx, , . . . , dx  centered  about  x 
r  I  p  o 

is  approximated  by: 


7/t  {  Xml  4  Xi  4  ^4*  4  Xp  S  Xmjt  J 

s  pt *•)  <*vt 


(III  -  3) 


where  p(xQ)  Is  the  marginal  probability  density  function  of 
x  evaluated  at  x  =  xq.  Similarly,  the  conditional 
probability  tnat  x  is  located  in  the  volume  dV,  given  that 
is  true,  can  be  approximated  by: 


([  X.,  4  X.  4  X.,  tetx. ,  Y.A  i  X/t  4  x ] j  #'! 


£  pi  (X.)  at  V 


(III  -  4) 


where  p . (x  )  »  p.(x  |H.)  is  the  conditional  probability 

10  1  O  1 

density  function  of  x,  given  that  is  true,  evaluated  at  x 

■  x  •  For  sc  continuous  in  dV,  as  dV  becomes  small,  the 
o 

Also,  because  p(x  )  and 

o 


approximations  above  become  exact* 
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p,(x  )  are  constant  for  a  given  value  of  x  ,  the 
1  o  o 

probabilities  in  (III-3)  and  (III-4)  approach  zero  with  dV. 
However,  in  the  ratio  given  by  Bayes'  Rule,  dV  is  present  in 
both  the  numerator  and  denominator,  so  that  it  cancels. 
Where  this  is  the  case,  the  probability  that  is  true, 
given  a  realization,  xq,  of  the  random  vector  x,  may  be 
written: 


t  .  ,  +  -  )  ?;(*•) 

TUlUif 1  - ^755 -  (in  -  5) 

In  many  cases  it  is  sufficient  to  specify  that  IT  be 
classified  into  the  population  which  maximizes  Pr { H  |x»xo > . 
This  is  actually  a  special  case  of  the  Bayes'  criterion, 
where  the  assigned  costs  of  misclass if ying  an  observation 
are  equal,  and  the  costs  of  correctly  classifying  the 
observation  are  zero.  On  the  other  hand,  it  often  happens 
that  this  rule  cannot  be  used.  This  is  true,  for  example, 
in  cases  where  the  prior  probabilities  of  the  hypotheses 
(PrtH^})  are  unknown,  or  where  the  experimenter  wishes  to 
weight  the  relative  importance  of  various  kinds  of 
misclassif ication.  It  is  desireable,  then,  to  have 
classification  criteria  available  which  allow  for  both  of 
these  instances. 

Three  general  criteria  are  presented  in  Appendix  A. 
The  Bayes'  criterion  is  the  most  powerful,  in  that  it  allows 
for  specification  of  weights  (costs)  on  the  various  types  of 
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misclassif ication.  However,  the  Bayes'  criterion  requires 
that  the  exact  prior  probabilities  for  the  hypotheses  be 
specified.  The  ninlmax  criterion  allows  for  the 
specification  of  costs,  but  does  not  require  the 
specification  of  prior  probabilities.  The  Neymann-Pear son 
criterion  specifies  the  acceptable  threshold  probability  for 
misclass if ication  of  Tc  into  a  particular  population,  and 
requires  neither  costs  nor  explicit  specification  of  prior 
probabilities . 


Classification  Into  One  of  Two  Multivariate 
Normal  Populations 

The  problem  considered  in  this  section  is  that  of 

classifying  a  seismic  event,  based  upon  multiple 

measurements  at  each  station,  averaged  over  the  total  number 

of  stations.  This  is  the  problem  originally  considered  by 

Booker  and  Mitronovas  (1964).  Specifically,  the  two 

populations  consist  of  earthquakes  and  underground  nuclear 

explosions.  The  measurement  random  variables  are  the  energy 

ratios  of  Booker  and  Mitronovas  (1964),  augmented  with 

estimated  M  /m,  ratios, 
s  D 

If  1c  represents  the  random  vector  of  these 
measurements,  x  is  assumed  to  have  the  multivariate  normal 
distribution  N(m^,C1)  if  x  originates  in  the  earthquake 
population,  and  N(m2»C2)  if  x  originates  in  the  underground 
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explosion  population.  Here  m^,  C^,  and  m^,  are  the  mean 
vectors  and  covariance  matrices  of  the  earthquake  and 
explosion  populations,  respectively.  These  distributions 
are  given  in  terms  of  the  likelihood  functions  p^(x)  and 
p2(x): 


’  uv*  rc./*  e*/>  j 


(III  -  6) 


and 


/*'*J  '  nv)*ic*iK  f-f  C*fx-£k)J 


(III  -  7) 


Defining  the  hypotheses  and  H2  for 

realization  x  »  x  : 

o 

H^:  xq  originates  in  A  ^  , 


and 


the  specific 


H2:  xq  originates  in  A2, 

it  is  possible  to  construct  the  likelihood  ratio  test  for 


Choose  Hj  if  P  L  (  )  /  p  2  (  xq  )  >.  A  T  , 

and  choose  otherwise, 

where  A  j  is  the  threshold  value  determined  from  the 
classification  criterion  (see  Appendix  A). 


— f-ifrlfrnd 
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Substituting  the  normal  densities  given  for  p^(x  )  and 
in  and  (III-7),  the  likelihood  ratio  becomes: 


j  £  \ 

e*r  /-  i  [($•  -  )  'c  "r 

-  (  Y.~  ^1, ;  Ct  fx.-J*,)]}. 


(ill  -  a) 


Because  the  logarithm  is  a  mono tonically  increasing 
function,  the  likelihood  ratio  test  for  may  be  rewritten 
in  terms  of  the  log-likelihood  ratio  test  without  changing 
the  direction  of  the  inequality: 


Choose  H  if 


[(  j(.-  m, )  C,  If.-/*')  •  ( 


■  i  U, 


(in  -  y) 


If  the  covariance  matrices,  and  C are  equal,  the 
expression  to  the  left  of  the  inequality  is  linear  in  x. 
However,  if  f  C^,  the  expression  on  the  left  is  quadratic 
in  x.  In  their  analysis,  Booker  and  Mitronovas  (1964)  made 
the  assumption  that  the  covariance  matrices  are  equal, 
thereby  linearizing  the  likelihood  ratio.  Where  this  is  the 
case,  the  likelihood  ratio  test  becomes: 
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Choose  Hj  if 


.•»  ' 

X. 


C  ( -  ^*Sr,  )  t 


i  -  )  C  f  ,  *  j£,  j 


i  /oj  Ay 


(III  -  10) 


The  log-likelihood  ratio  on  the  left  of  (111-10)  is  known  as 
Fisher's  (1936)  discriminant  function.  By  letting 


(III  -  11) 


and 


k  »  (n*t  )  C  ( \)  > 


(III  -  12) 


the  Fisher  discriminant  function  may  be  written: 


LUX.)  •  ?.'? 


(III  -  13) 


In  this  final  form,  the  linear  log-likelihood  ratio 
test  for  is  given  by: 

Choose  if  U(x)  log(  A  ) , 
choose  H2  otherwise. 

For  x  -  x^, ,  where  x^,  is  chosen  so  that 


(Ill  -  14) 


U(yT)  z  *'  £  ♦  (c  =  Ar, 


the  function  U(x^,)  describes  the  surface  which  optimally 
divides  the  parameter  space  of  x  into  the  two  classification 
regions  R^  and  R£*  Because  U(x)  is  linear  in  x,  U(xT> 
defines  a  hyperplane  in  the  parameter  space  R  »  (R^U  R^)* 
This  situation  is  illustrated  schematically  for  the 
univariate  case  in  Figure  (III-l). 

On  the  other  hand,  if  the  two  population  covariance 
matrices  are  not  equal,  the  surface  dividing  R  is  quadratic 
in  x : 

U  /  *rJ  *  C?  -  c;'j  Yt  *  Yr'  f  -  C*  -*"*  ) 

*  ~S(  »  (-»  ■**»  -  '  •*.  C,  / 


— 


foQ  Jr 


'’3  IT.t 


(III  -  15) 


In  the  two-dimensional  case,  U(x)  describes  a  quadratic 
curve,  either  an  ellipse  or  a  pair  of  hyperbolas,  where  the 
decision  regions  R^  and  are  either  the  areas  interior  and 
exterior  to  the  ellipse,  or  the  areas  bounded  by  the  pair  of 
hyperbolas  (Anderson  and  Badahur,  1962).  A  likelihood  ratio 
test  based  on  a  quadratic  log-likelihood  ratio  relies  quite 
heavily  on  the  assumption  of  normality,  because  the 
classification  is  determined  to  a  great  extent  by  those 
values  of  the  likelihood  functions  located  at  some  distance 


from  the  mean  where  the  probability  densities  are  small. 


This  situation  is  illustrated  for  the  univariate  case  in 


Figure  (III-2). 

The  case  where  the  population  covariance  matrices  are 
unequal  and  the  assumption  of  normality  is  valid  for  regions 
close  to  the  mean,  but  not  necessarily  for  regions  at  any 
great  distance  from  the  mean,  has  been  considered  by 
Anderson  and  Badahur  (1962).  Their  analysis  consists  of 
finding  the  linear  function  (hyperplane)  which  minimizes  a 
function  of  the  error  probabilities  that  corresponds  to 

either  the  Neymann-Pearson ,  Minimax,  or  Bayes'  criterion. 

q 

If  the  linear  function  is  given  by: 


H  (X)  -  H  )r  f 


(Ill  -  16) 


the  vector  r  is  found  to  be  the  product  of  the  inverse  of  a 
weighted  sum  of  the  covariance  matrices  and  the  difference 
between  the  population  means: 

--  (  t>C,  *iiCx)  ,  (III  -  17) 


Anderson  and  Badahur  (1962)  suggest  solving  for  r  by 
iteratively  substituting  for  t^  and  t ^  until  an  auxiliary 
condition  (determined  by  the  function  of  the  error 
probabilities  used  as  an  optimality  criterion)  is  satisfied. 
This  technique  has  the  advantage  of  being  more  robust  to 
deviations  from  the  normal  assumption  than  the  quadratic 
procedure  discussed  previously;  however,  it  is  far  less 
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efficient  computationally  than  the  techniques  described  for 
the  linear  and  quadratic  procedures  above. 


Classification  based  on  Samples  taken  from 
the  Respective  Populations 


In  many  problems,  including  that  of  classifying 
earthquakes  and  underground  explosions,  the  parameters  (i.e. 
the  mean  vectors  and  covariance  matrices)  of  the  respective 
populations  are  not  known  a  priori,  and  must  be  estimated 
from  a  "training  set"  of  data.  Letting 

represent  a  set  of  observations  from  A,  and  x^ . ,x^„_ 

1  l  N  / 

represent  a  similar  set  from  ,  the  problem  becomes  one  of 
obtaining  the  discriminant  function  U(3c)  so  that  a  new 
observation  (xq)  may  be  classified  into  A^  or  A^. 

In  the  linear  problem  where  the  population  covariance 
matrices  are  assumed  to  be  equal,  the  sample  mean  vectors 
for  the  two  populations  are  given  by: 


it  * 


M 


v. 

> 

7" 


X. 


(III  -  18) 


The  estimate  for  the  common  covariance  matrix  is  given  by 
the  within  groups  sample  covariance  matrix: 


(  k,  *  -  a  )  S 


(in  -  x 9 ) 


(*/-  x,)  (  */ -  Xi)'J 


Substitution  of  x^,  x^,  and  S  into  the  log  likelihood  ratio 
in  (III— 10)  gives  the  Fisher  (1936)  discriminant  function 
for  the  case  where  the  parameters  are  estimated: 


Urx>  --  *V?  -  %(  x,  ♦  f, )  ?,- *<)  (III  _  2o) 


By  applying  the  discriminant  function  given  ion  (111-20) 
to  the  training  set,  it  is  possible  to  evaluate  how  well  the 
discriminant  analysis  scheme  separates  the  sample 
observation  vectors.  Estimates  of  the  overall  average 
probability  of  correct  classification  may  be  obtained  by 
evaluating  PrOl^lx5*^}  for  each  observation  In  the  training 
set.  In  terms  of  the  likelihood  functions  p^(x)  and  p2(x), 
the  probability  that  is  true  given  that  x  =  xo  defined 

in  (I II-5 )  as : 

,  „  ,  Pi(  *•) 

PstfW/f'*'}  *  - '  (III  -  21) 


where  p(x) 
by: 


is  the  marginal  probability  density 


f>  ft  )  «  £  X  )  P/*  /  / 


of  x,  given 


(III  -  22) 


For  a  specified  set  of  prior  probabilities,  (III-21) 
may  be  evaluated  for  each  observation  vector  in  the  training 
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set  to  give  the  classification  probabilities  for  each  event* 
The  average  probability  of  correct  classification  for  the 
training  set  may  then  be  found  by  averaging  the 
classification  probabilities  for  each  event  over  the  two 
populations.  Inasmuch  as  the  parent  population  for  each 
event  in  the  training  set  is  known,  if  xq  denotes  a  sample 
vector  from  A^,  then  PrOl^lx^xj^}  is  the  probability  of 
correctly  classifying  xq  and  PrOljlS-^}  is  the  probability 
of  misclassif ying  xq.  The  overall  probability  of  correct 
classification  in  terms  of  the  samples  in  the  training  set 
is  given  by: 


i.i  a *j i 
r 


•  Z' 

Ni  A  f  Hi!  **  Xj  I . 

iu 


(III  -  23) 


In  a  like  fashion,  the  overall  average  probability  of 
mi sclas s i f i c a t ion  is  given  by: 


h  - 


r, 


**  l*/f 


(Ill  -  24) 


In  addition  to  the  determination  of  the  classification 
probabilities  P^  and  P^,  a  discriminant  analysis  of  the 
training  set  may  provide  an  insight  into  which  subset  of 
variables  best  contributes  to  the  separation  of  the  two 
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populations.  The  Mahalanobis  D  statistic  may  be  used  as  a 
measure  of  the  separation  of  the  two  populations  (Anderson, 
1958).  The  Mahalanobis  statistic  is  defined  (Rao,  1973): 

*  '  t  t.  -  *« )  Z  )  ( 1 1 1  -  2  5  ) 

This  statistic  may  be  normalized  and  used  as  an  F-statistic 

to  test  the  hypothesis  that  the  two  population  means  are 
2 

equal.  The  D  statistic  (or  its  associated  F-statistic) 
may  be  calculated  for  any  single  variable  in  the 
discriminant  analysis,  or  for  any  group  of  variables.  This 
provides  a  tool  for  determining  which  of  the  measured 
variables  contributes  the  greatest  amount  of  information  to 
the  discriminant  analysis  procedure. 


Cluster  Analys is  Techniques 

In  addition  to  the  multiple  discriminant  analysis 
theory  presented  above,  several  cluster  analysis  techniques 
are  applied  to  the  Booker  and  Mitronovas  (1964)  energy 
ratios  and  the  estimated  M^m^  ratios.  Cluster  analysis 
appears  to  be  an  attractive  means  for  grouping  data  because 
no  external  probabilistic  model  need  be  assumed.  Instead, 
cluster  groupings  are  formed  according  to  the  optimization 
of  a  specified  criterion.  For  example,  in  this  study  the 
criterion  used  to  form  clusters  is  the  minimization  of  the 


<•  ■  St 
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total  error  sum  of  squares. 

A  cluster  may  be  viewed  as  a  collection,  or  set  of 
events,  where  the  total  error  sum  of  squares  is  minimized. 
In  equivalent  terms,  the  Euclidean  distance  from  each  event 
to  the  centroid  of  the  parent  cluster  is  less  than  the 


distance  to  the  centroid  of  any  other  cluster.  In  the  case 

of  the  Booker  and  Mitronovas  data  set,  each  event  is 

considered  as  a  point  in  a  seven-dimensional  space  (or  an 

eight-dimensional  space  where  estimated  M  /m,  ratios  are 

s  b 

included).  The  similarity  measure  used  to  fuse  these  events 
into  clusters  is  the  squared  Euclidean  distance.  The 
distance  between  the  ith  and  jth  event  in  a  p-dimensional 
space  is  defined  as: 


<y,v  *  1  (*>k  - 

7 


(III  -  26) 


The  clustering  techniques  used  in  this  study  consist  of 
Ward's  hierarchical  clustering  procedure  and  an  iterative 
relocation  procedure,  both  of  which  are  described  by  Everitt 
(1974)  and  Anderberg  (1973). 

The  strategy  used  in  Ward's  procedure  involves  first 
constructing  a  similarity  matrix  containing  the  squared 
Euclidean  distance  between  each  pair  of  events  in  the  input 
data  set.  In  the  beginning,  each  event  is  viewed  as  a  single 
cluster.  Events  are  fused  in  such  a  way  that  the  smallest 
increase  in  the  error  sum  of  squares  occurs.  The  error  sum 


of  squares  is  defined  as  the  sum  of  the  distances  from  each 


1 
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event  to  the  centroid  of  its  parent  cluster  (Wishart,  1975), 
where  the  cluster  centroid  is  simply  the  cluster's  center  of 
mass.  At  the  first  step,  with  each  event  viewed  as  a  single 
cluster,  the  total  error  sum  of  squares  is  zero.  The  two 
closest  events  are  fused  to  form  a  single  cluster.  This 
process  proceeds,  either  by  fusing  pairs  of  events,  by 
adding  events  to  existing  clusters  of  events,  or  by  fusing 
clusters  until  only  one  large  cluster  remains. 

Ward's  method  has  the  disadvantage  that  cluster 
groupings  obtained  at  an  early  level  in  the  procedure  remain 
fixed  throughout  the  clustering  process.  That  is,  if  two 
events  are  fused  early  in  the  clustering  procedure,  they 
remain  as  members  of  the  same  cluster  throughout,  even 
though  at  some  later  point  a  decrease  in  the  total  error  sum 
of  squares  might  be  realized  by  assigning  the  events  to 
different  clusters.  This  problem  can  be  avoided  by  using  an 
iterative  relocation  procedure,  such  as  the  one  described  by 
Wishart  (  1975  )  . 

In  Wishart's  RELOCATE  procedure,  at  any  given  step, 
each  object  is  considered  in  turn,  and  removed  from  its 
parent  cluster.  The  distance  from  the  removed  event  to  the 
centroid  of  each  cluster  is  computed,  and  the  event  is 
returned  to  that  cluster  for  which  this  distance  is 
minimized.  After  the  relocation  cycle  has  been  completed 
(that  is,  when  no  more  events  can  be  moved),  the  two  most 
similar  clusters  are  fused.  These  relocation  and  fusion 
procedures  may  then  be  alternated  until  the  desired  number 


of  clusters  is  obtained 


The  clustering  procedures  described  in  this  section  and 
the  multiple  discriminant  analysis  procedures  described  in 
the  previous  section  were  both  used  to  classify  a  set  of 
earthquakes  and  underground  explosions  originally  studied  by 
Booker  and  Mitronovas  (1964).  The  next  chapter  contains  a 
description  of  the  Booker  and  Mitronovas  data  set,  the 
programs  used  in  the  classification  schemes,  and  the  results 
obtained  by  applying  these  programs  to  the  Booker  and 
Mitronovas  data  set. 
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CHAPTER  IV 


APPLICATION  OF  CLUSTERING  AND  DISCRIMINANT  ANALYSIS 
TECHNIQUES  TO  A  SET  OF  EARTHQUAKES  AND  UNDERGROUND 

EXPLOSIONS 


Description  of  the  Data 

The  data  analyzed  using  the  discriminant  analysis  and 

clustering  techniques  described  earlier  consist  of  a  set  of 

20  earthquakes  and  27  Nevada  Test  Site  (NTS)  underground 

nuclear  explosions.  Summary  information  for  these  events  is 

provided  in  Table  (IV-1).  With  the  exception  of  three 

explosion  collapses  which  have  been  deleted  in  the  present 

study,  this  is  the  same  set  of  events  studied  by  Booker  and 

Mitronovas  (1964),  and  by  Mitronovas  (1963). 

The  measurement  variables  on  which  the  discriminant  and 

cluster  analyses  are  based  are  the  set  of  seven  energy 

ratios  published  by  Booker  and  Mitronovas  (1964).  In 

addition,  for  events  where  the  body  wave  magnitudes  have 

been  calculated,  the  original  set  of  variables  is  augmented 

with  estimated  M  /m,  ratios,  where  the  M  estimates  were 

so  s 

randomly  generated  about  the  M  / m,  lines.  Measured  M 

so  s 

values  could  not  be  used  because  the  long-period  data  for 
these  events  are  either  nonexistent  or  of  such  poor  quality 
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Chat  reliable  M  estimates  could  not  be  obtained.  Instead, 
s 

the  Mg  estimates  were  generated  by  drawing  upon  recent 

existing  results  relating  M  to  m.  for  NTS  explosions  and 

earthquakes.  The  Mg/m^  ratios  used  were  calculated  by 

generating  values  for  Mg ,  normally  distributed  with  a 

specified  variance,  about  the  expected  point  on  the 

appropriate  Ms/mb  line.  The  Mg/m^  lines  used  in  this  study 

are  those  given  by  Alexander  and  Lambert  (1973),  typical  of 

underground  explosions  and  small-magnitude  earthquakes  in 

the  Western  United  States.  Four  M  /m^  ratios  were 

s  b 

calculated  for  each  event  using  published  body  wave 

magnitudes.  In  each  case  the  variance  used  in  generating 

the  earthquake  Mg  estimates  is  twice  that  used  in  generating 

the  Mg  estimates  for  the  underground  nuclear  explosions. 

This  greater  uncertainty  is  incorporated  to  reflect  the 

assumed  complexity  and  variability  of  the  earthquake  source 

mechanism,  as  compared  to  that  of  the  underground  explosion 

source  mechanism.  The  explosion  Mg  variance  estimates  were 

chosen  as  multiples  of  the  M  variance  given  by  von  Seggern 

s 

(1972).  Table  (IV-2)  lists  the  energy  ratios  used  in  this 

study.  The  m,  values,  the  M  estimates  and  the  M  /m,  ratios 

b  s  s  b 

are  given  in  Table  (IV-3).  Mg  versus  m^  is  plotted  for  each 
variance  level  in  Figures  (IV-1)  through  (IV-4). 

The  energy  ratios  for  each  station  were  originally 
obtained  by  Booker  and  Mitronovas  by  calculating  the 
energies  within  time  intervals  corresponding  to  specified 
velocity  windows  for  each  event,  and  forming  ratios  for 


those  window  combinations  which  appeared  to  give  promising 


discrimination 

criteria . 

The 

total 

energy  within 

each 

window  for  a 

given  event 

was 

found 

by  summing  over 

the 

square  of  the  vertical,  radial,  and  transverse  components  at 
each  station,  then  averaging  over  the  stations.  These 
values  were  corrected  for  microseismic  noise  by  subtracting 
the  average  noise  energy  based  on  samples  taken  in  a  window 
immediately  preceding  the  first  arrival  for  each  event. 

The  above  calculations  were  performed  on  302 
seismograms  representing  the  50  events.  Each  seismogram  was 
recorded  on  Long  Range  Seismic  Monitoring  (LRSM)  network 
short-period  Benioff  seismometers.  Because  normally  the 
LRSM  horizontal  radial  component  is  oriented  toward  NTS,  the 
horizontal  components  were  appropriately  rotated  prior  to 
evaluating  the  energy  ratios  for  each  earthquake.  The 
velocity  windows  used  in  forming  the  energy  ratios  are 
listed  in  Table  (IV-4). 

Discriminant  Analvs es 

The  discriminant  analysis  in  this  paper  consists  of  two 
parts.  The  first  part  is  simply  an  attempt  to  duplicate  the 
results  obtained  by  Booker  and  Mitronovas  (1964)  using  the 
1975  revised  version  of  BMD04M,  a  two  group  linear  multiple 


discriminant  analysis  program  supplied  in  the  Biomedical 
Computer  Programs  Package  (BMD),  and  published  by  the  UCLA 
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School  of  Medicine.  These  results  are  compared  with  those 

obtained  using  a  more  recent  linear  multiple  discriminant 

analysis  program,  BMD07M,  supplied  in  the  same  package,  and 

with  the  results  obtained  using  the  quadratic  discriminant 

analysis  program  described  in  Appendix  B. 

The  second  part  consists  of  a  discriminant  analysis  of 

the  subset  of  the  Booker  and  Mitronovas  events  which  has 

been  augmented  with  the  estimated  M  /m,  ratios  described 

s  b 

above.  The  M  /m,  discriminant  is  one  of  the  most  successful 
s  b 

discriminants  yet  found,  so  that  its  introduction  should 
greatly  enhance  the  performance  of  the  original  Booker  and 
Mitronovas  discriminant  analysis  scheme. 


Program  Descriptions 

The  BMD04M  program  calculates  the  linear  term  of  the 
Fisher  (1936)  discriminant  function  for  each  event.  If  the 
ith  event  is  denoted  by  x^,  this  linear  term  is  given  by: 

Utii)  '  Hi'*  (  *.-*«),  (IV  -  1) 

where  x^  and  x^  denote  the  respective  sample  means  vectors 
of  the  earthquake  and  explosion  populations,  and  S  is  the 
within-group  scatter  matrix,  given  by: 
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(IV 

-  2) 

Ml  1 

The  values  of 

U(xA) 

are 

then 

listed 

in  order 

of 

decreasing  algebraic 

magnitude. 

Because  the 

constant 

term 

of  the  discriminant 

f unc t ion  is 

common  to  all 

events , 

and 

does  not  affect  the 

ordering , 

it  is 

not  included  in 

the 

calculation  of  the  UCx^). 

In  addition  to  computing  and  rank  ordering  the  U(xi>, 

2 

BMD04M  computes  the  Mahalanobis  D  statistic,  given  by: 


y  «  (  X,  -  Xt)'  s" (X,  -  *,  )  , 


(IV  -  3) 


and  the  associated  F-statistic  for  testing  the  differences 
in  group  means: 


f  f  >  Nt  +  Nx-p~i) 


A/,  A/t  (AJ,  *A/t  ~fi~/ ) 

-  D\ 

f>  fM,  *  A/i)  (ti.  +  A/t  -2) 
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Program  BMD07M  also  computes  the  linear  discriminant 
function.  However,  BMD07M  is  more  flexible  than  BMD04M  in 
that  provision  is  made  for  the  introduction  of  prior 
probabilities,  whereas  BMD04M  assumes  that  all  prior 
probabilities  are  equal.  BMD07M  is  able  to  classify  into 
more  than  two  groups,  and  calculates  posterior 
classification  probabilities  for  each  event.  However,  the 
most  useful  char ac t er is r t ic  of  this  program  is  the  way  in 
which  variables  are  entered  into  the  discrimination  process. 


BMD04M  calculates  initial  F-statistics  for  each  variable 
alone,  then  constructs  classification  functions  using  the 
entire  specified  set  of  variables.  BMD07M,  on  the  other 
hand,  computes  the  initial  F-statistics  for  each  variable, 
then  enters  variables  in  a  stepwise  fashion,  where  the 
F-to-enter  and  F-to-remove  statistics  are  updated  at  each 
step.  This  feature  allows  for  a  comparison  of  the  relative 
discriminating  power  among  variables,  as  well  as  for  each 
variable  taken  alone.  For  example,  two  variables  may  have 
initial  F-statistics  of  comparable  magnitude.  Upon  entering 
the  variable  with  the  larger  initial  F  value,  the  F-to-enter 
for  the  second  variable  may  decrease  significantly.  This 
indicates  that  the  two  variables  are  highly  correlated,  so 
that  the  second  variable  contributes  only  a  marginal  amount 
of  additional  information  to  the  discrimination  process. 
Because  this  facility  allows  the  program  to  be  used  to 
obtain  maximally  independent  sets  of  discriminant  variables, 
BMD07M  should  prove  to  be  particularly  valuable  in  studying 
the  earthquake/underground  explosion  discrimination  problem. 

Quadratic  discriminant  procedures  provide  a  means  for 
discriminating  between  groups  where  the  events  are  normally 
distributed,  but  where  the  covariance  matrices  differ 
markedly.  Such  procedures  sacrifice  some  of  the  robustness 
of  the  linear  method  to  deviations  from  the  normal 
hypothesis,  but  they  may  provide  better  classifications  than 
the  linear  procedures  where  the  normal  hypothesis  is 
applicable  and  the  covariance  matrices  for  the  two  groups 
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differ  significantly.  A  description  of  the  quadratic 
discriminant  analysis  program  used  in  this  study  is  provided 
in  Appendix  B  . 


Discriminant  Analysis  Results  for  the  Original 
Booker  and  Mitronovas  Data  Set 

The  classification  obtained  using  BMD04M  is  very 
similar  to  the  one  obtained  by  Booker  and  Mitronovas  (1964), 
except  that  several  adjacent  events  have  been  interchanged 
in  the  rank  ordering.  The  misclassif ication  of  the  Boxelder 
Creek  earthquake  occurs  in  the  same  fashion  here  as  in  the 
previous  studies.  The  classification  obtained  using  BMD07M 
where  equal  prior  probabilities  are  assumed  is  identical  to 
that  obtained  using  BMD04M.  This  assumption  of  equal  prior 
probabilities  is  made  for  all  the  discriminant  analyses 
presented  here.  The  ratios,  the  order  in  which  they  are 
entered,  the  initial  F- s t a t i s t ic s  ,  and  the  F-to-enter  and 
F-to-remove  values  at  each  step  are  given  in  Table  (IV-5). 

Before  any  of  the  variables  are  entered,  the  value  of 

2 

the  F-statistic  based  on  the  Mahalanobis  D  statistic  is 
calculated  for  each  variable.  This  is  a  measure  of  the 
separation  in  group  means  achieved  by  each  variable 
individually.  As  shown  in  Table  (IV-5),  variable  4  (Pg/Lg^) 
is  the  most  powerful  discriminant  among  the  Booker  and 
Mitronovas  energy  ratios.  Next  is  variable  3  (Pg/B),  then 
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variable  1  (P^/S^),  variable  5  ( P g / [ Lg ^+Rg ^ ]  ) ,  and  variable 
2  (P2/S2).  Variables  6  (Rg^/Lg^  and  7  (Rg2/Lg2),  where 
taken  alone,  do  not  appear  to  be  promising  discriminants. 

The  first  variable  to  be  entered  is  variable  4.  Of  the 
remaining  6  variables,  variables  2  and  5  provide  the 
greatest  additional  discriminating  power.  Upon  entering 
variable  2  at  the  second  step,  variables  1  and  3  become  the 
most  powerful  of  those  yet  to  be  entered.  This  indicates  a 
strong  correlation  between  variables  2  and  5,  so  that 
variable  5  contributes  only  marginally  to  the  discriminant 
process  after  variable  2  has  been  entered.  The  process  of 
entering  the  variables  continues  until  either  all  variables 
have  been  entered,  or  until  none  of  the  remaining  variables 
can  meet  the  F-to-enter  threshold.  The  F-to-enter  threshold 
is  set  by  default  to  a  small  nominal  value  (0.001). 
However,  this  value  may  be  adjusted  at  the  user's  option. 

Using  only  the  original  Booker  and  Mitronovas  energy 
ratios,  the  probability  of  correctly  classifying  an 
earthquake  is  0.821.  That  of  correctly  classifying  an 
explosion  is  0.771.  The  overall  average  probability  of 
correct  classification  is  0.792.  The  two  events  that  are 
misclass if ied  are  the  Boxelder  Creek  earthquake,  identified 
as  an  explosion  with  probability  0.67,  and  the  Codsaw 
explosion,  marginally  classified  as  an  earthquake  with 
probability  0.58. 

BMD07M  also  computes  the  canonical  transformation  for 
the  set  of  events  under  consideration.  The  canonical 
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variables  are  determined  from  a  linear  transformation  of  the 
original  set  of  variables  which  maximizes  the  b e twee n-gr oup 
scatter  with  respect  to  the  within-group  scatter  for  a  given 
partitioning  of  the  data  set.  Separation  of  the  group  means 
are  reflected  in  the  first  canonical  variable  in  the 
two-group  case,  where  the  remaining  variables  reflect 
within-group  differences.  A  scatterplot  of  the  first 
canonical  variable  versus  the  second  is  shown  for  the  entire 
set  of  27  explosions  and  20  earthquakes  in  Figure  (IV-5). 

The  final  discriminant  analysis  of  the  original  Booker 
and  Mitronovas  data  set  given  here  consists  of  a  quadratic 
discriminant  analysis  using  the  program  described  in 
Appendix  B-  The  Boxelder  Creek  earthquake  is  mi s c la ss i f ied 
under  this  quadratic  scheme  as  it  was  under  the  linear 
scheme,  but  with  probability  0.95  instead  of  0.67.  However, 
instead  of  marginally  mis c lass i f y ing  the  Codsaw  underground 
explosion  as  an  earthquake  as  happens  using  the  linear 
procedure,  the  quadratic  procedure  correctly  identifies  this 
explosion,  but  marginally  identifies  the  Cache  Creek 
earthquake  as  an  explosion  with  probability  0.53.  In  both 
the  linear  and  quadratic  cases,  however,  the  marginal 
mis c las s if tc a t ions  may  be  avoided  by.  a  judicious  choice  of 
the  prior  probabilities.  Under  the  quadratic  procedure,  the 
average  posterior  probability  of  correctly  classifying 
either  an  earthquake  or  an  underground  explosion  is  0.94. 
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Discriminant  Analysis  Results  for  the  Reduced 

Data  Set  with  Estimated  M  /m.  Ratios 

s  b 

As  mentioned  in  the  data  description  above,  a  subset  of 
the  events  used  by  Booker  and  Mitronovas  have  associated 
with  them  published  body  wave  magnitudes.  This  subset 
consists  of  the  nine  earthquakes  and  21  underground 
explosions  shown  in  Table  (IV-3).  Because  published  Mg 
values  are  not  available  for  any  of  these  events,  estimated 
Mg/mb  ratios  have  been  used,  where  the  Mg  values  are  typical 
(statistically)  of  small  magnitude  earthquakes  and 
underground  explosions  in  the  Western  United  States. 

The  first  step  in  this  classification  consists  of  a 

linear  discriminant  analysis  where  BMD07M  is  applied  to  this 

subset  of  events,  but  where  only  the  seven  Booker  and 

Mitronovas  energy  ratios  are  used.  Under  this  linear 

procedure,  where  equal  prior  probabilities  are  assumed, 

eight  of  the  earthquakes  are  correctly  classified  and  one 

earthquake  'Cache  Creek)  is  m is c la ss if ied  as  an  explosion 

with  probability  0.53.  The  Boxelder  Creek  event  is  not 

included  in  this  analysis,  because  published  m,  values  for 

t> 

this  event  could  not  be  found.  Of  the  explosions,  19  are 
correctly  classified  and  two  (Codsaw  and  Stoat)  are 
misc lass  if ied  as  earthquakes  with  probabilities  0.65  and 
0.85  respectively.  The  average  posterior  probability  of 
correctly  classifying  an  earthquake  is  0.81,  and  that  of 
correctly  classifying  an  underground  explosion  is  0.77.  The 
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overall  average  probability  of  correct  classification  is 
0.78.  A  summary  of  the  stepwise  discrimination  process  for 
this  set  of  events  is  given  in  Table  (IV-6),  and  a 
scatterplot  of  the  first  two  canonical  variables  is  given  in 
Figure  (IV-6). 

The  initial  F-statistics  for  this  reduced  set  of  events 
is  comparable  to  that  found  for  the  entire  set  of  20 
earthquakes  and  27  explosions.  Taken  alone,  variable  4 
(Pg/Lg^)  gives  the  greatest  separation  of  the  group  means. 
Variables  3  (Pg/B),  1  (P^/S^)  and  5  (Pg/ [Lg^+Rg^] )  are  next 
and  exhibit  comparable  discriminating  potential.  Variables 
6  (Rgj/Lgj)  and  7  (Rg^/Lg^  exhibit  an  insignificant 
separation  of  group  means. 

The  quadratic  discriminant  analysis  procedure,  when 
applied  to  this  subset  of  events,  yields  excellent  results 
with  no  events  raise  lass  if ied .  The  average  posterior 
probability  of  correctly  classifying  an  earthquake  is  0.95, 
and  that  of  correctly  classifying  an  explosion  is  0.99, 
which  yields  an  overall  average  probability  of  correct 
classification  of  0.98. 

It  should  be  noted  however,  in  the  case  of  the 
quadratic  discriminant  procedure,  that  the  results  obtained 
for  this  reduced  set  of  events  might  be  somewhat  misleading. 
The  reason  for  this  is  that  the  covariance  matrices  for  a 
given  population  are  calculated  only  from  the  events  in  that 
population.  Because  the  earthquake  population  in  the 
reduced  set  consists  of  only  nine  events,  with  measurements 
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on  seven  variables  (or  eight  variables  where  the  Ms/m^  ratio 
is  included),  a  reliable  estimate  for  the  population 
covariance  matrix  may  not  have  been  obtained.  This  is  not 
necessarily  the  case  for  the  linear  procedure,  where  the 
single  common  covariance  matrix  is  calculated  as  a  weighted 
average  of  the  individual  population  covariance  matrices, 
where  the  weights  are  determined  from  the  number  of  events 
in  each  group . 

The  effect  of  adding  the  Ms/mb  ratios  to  the  original 

set  of  energy  ratios  is  quite  dramatic.  A  summary  of  the 

stepwise  procedure  used  in  BMD07M  is  listed  in  Tables  (IV-7) 

to  (1V-10)  for  each  of  the  four  sets  of  Mg/mb  ratios,  where 

M  /m,  is  listed  as  variable  8.  Scatterplots  of  the  first 
s  b 

two  canonical  variables  for  each  set  of  M  /m  ratios  are 

s  b 

given  in  Figures  (IV-7)  to  (IV-10). 

In  each  case  the  M  /m,  ratio  was  found  to  be  the  most 

s  b 

powerful  discriminant  of  the  eight  variables  employed  and, 

in  each  case,  was  the  first  variable  entered.  For  the  first 

two  sets,  where  the  variance  of  the  explosion  Mg  estimates 

is  equal  to  and  1.5  times  the  variance  found  by  von  Seggern 

(1972),  the  M  /m,  ratio  alone  is  sufficient  to  discriminate 
s  b 

successfully  between  the  two  groups.  For  the  third  and 

fourth  sets,  where  the  explosion  variance  is  twice  and 

three  times  times  that  given  by  von  Seggern,  only  one 

m is c la ss if ica t ion  occurs  where  the  M  / m,  ratio  is  used 

s  b 

alone.  For  the  third  set,  this  misclass if icat ion  is 


corrected  by  entering  the  remaining  seven  variables  (energy 

ratios).  However,  an  interesting  situation  occurs  for  the 

fourth  set,  as  shown  in  Table  (IV-10).  The  first  variable 

entered  is  variable  8  (Mg/m^).  At  this  point,  the  Passaic 

explosion  is  misclass if ied  as  an  earthquake.  The  next 

variable  entered  is  variable  3  (Pg/B),  and  at  this  step 

Passaic  is  correctly  classified  as  an  explosion.  Variable  2 

(P2/S2)  is  entered  next,  and  the  classification  remains 

unchanged  with  no  events  misc lass  if ied .  However,  upon 

entering  variable  5  ( Pg/ [Lg  +Rg^ ] ) ,  Passaic  is  again 

misclass if ied  as  an  earthquake  and  remains  so  throughout  the 

rest  of  the  procedure.  The  Passaic  explosion  was  not 

misclassif ied  in  any  of  the  other  discriminant  analyses,  and 

is  only  marginally  misclass if ied  here  with  probability  0.60. 

From  Tables  (IV-3)  and  (IV-10),  and  Figure  (IV-4)  it  can  be 

seen  that  the  estimated  M  /m,  ratio  for  the  Passaic 

s  b 

explosion  is  more  than  two  standard  deviations  from  the 
explosion  mean  Mg  value,  and  is  within  one  standard 
deviation  of  the  mean  Mg  value  for  earthquakes.  For  this 
reason,  misclassif ication  of  the  Passaic  explosion  may  be 
expected  where  the  Mg/m^  ratio  is  used  alone.  Incorporation 
of  the  other  energy  ratios  shifts  the  event  back  and  forth 
between  the  two  populations,  resulting  in  a  final  marginal 
mi s c las s i f i ca t ion .  Because  the  classification  probability 
for  the  Passaic  explosion  is  closer  to  0.5  than  that  of  any 
other  event  identified  as  an  earthquake,  the  Passaic 
explosion  may  be  correctly  classified  by  using  a  different 


choice  for  the  prior  probabilities. 


Where  the  quadratic  discriminant  analysis  procedure  was 

used,  no  events  were  mis c las s if ied •  For  the  energy  ratios 

augmented  with  the  first  three  sets  of  M  /m,  ratios,  the 

s  D 

average  probability  of  correct  classification  for 

earthquakes  and  explosions  is  nearly  1.0.  For  M  /m,  set  4, 

s  b 

the  probability  of  correctly  classifying  an  explosion  is 
nearly  1.0,  and  the  probability  of  correctly  classifying  an 
earthquake  is  0.99. 

A  summary  of  the  classification  probabilities  for  each 
of  the  discriminant  analyses  is  given  in  Table  (IV-11). 


Cluster  Analysis  Results 


Program  Descriptions 

The  cluster  analysis  procedures  used  in  this  paper  are 
supplied  in  the  Clustan  IC  cluster  analysis  package 
developed  at  University  College,  London  (Wishart,  1975).  As 
mentioned  in  Chapter  III,  two  clustering  techniques  are 
employed  in  this  study.  The  first  is  Ward's  hierarchical 
clustering  procedure.  This  method  involves  constructing  a 
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similarity  matrix  containing  the  squared  Euclidean  distance 
between  each  pair  of  events  in  the  input  data  set.  At  the 
start  of  the  procedure,  each  event  is  considered  as  a  single 
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cluster,  and  the  two  events  with  the  smallest  Euclidean 
distance  between  them  are  fused  to  form  a  new  cluster.  A 
new  cluster  centroid  (center  of  mass)  is  computed,  the  total 
error  sum  of  squares  is  calculated,  and  the  similarity 
matrix  is  updated  where  the  centroid  of  the  new  cluster  is 
substitued  for  the  old  events  or  cluster  centroids.  In  this 
way,  the  rank  of  the  similarity  matrix  decreases  by  one  each 
time  a  new  cluster  is  formed.  This  process  continues  until 
the  desired  number  of  clusters  has  been  obtained.  At  each 
step,  the  events,  or  clusters,  which  are  fused  are  those 
which  result  in  the  smallest  increase  in  the  total  error  sum 
o f  squares  . 

In  the  case  of  the  hierarchical  procedure  contained  in 
the  Clustan  IC  package,  the  cluster  fusions  are  effected  by 
a  combinatorial  transformation  of  the  distance  matrix. 
Assuming  that  the  clusters  P  and  Q  have  just  been  fused,  the 
increase  in  the  error  sum  of  squares  which  occurs  upon 
fusion  of  the  new  cluster  (P  +  Q)  with  any  other  cluster  R,  is 
given  by  (Wishart,  1975): 


3  ft,  P  +  Q) 


/ 

*  ^9 


IjM*  >»*)  S(t.P) 


*  *  #9  )  $  (*,9)  - 


(IV  -  5) 


where  ,  N^,  and  are  the  respective  sizes  of  the 

clusters  P,  Q,  and  R. 

The  hierarchical  clustering  method  has  the  disadvantage 
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Chat  cluster  groupings  obtained  at  an  early  point  in  the 
clustering  procedure  must  remain  fixed  throughout  the 
process.  In  other  words,  if  two  events  are  fused  at  some 
point  to  form  a  new  cluster,  these  two  events  must  remain 
members  of  the  same  cluster  throughout  the  clustering 
process.  This  is  the  case  even  where  a  net  decrease  in  the 
total  error  sum  of  squares  might  be  obtained  if  the  two 
events  were  assigned  to  different  clusters.  The  RELOCATE 
procedure  in  the  Clustan  IC  package  circumvents  this 
problem.  The  RELOCATE  procedure  uses  a  convergent  K-means 
method  (Everitt,  1974)  to  relocate  events  iteratively  in  new 
clusters  at  each  step  in  the  hierarchical  procedure.  At  any 
given  step,  each  event  is  removed  from  its  parent  cluster, 
and  the  Euclidean  distances  between  the  event  and  the 
centroids  of  every  cluster  are  computed.  If  the  distance 
from  the  event  to  any  other  cluster  centroid  is  less  than 
the  distance  from  the  event  to  the  centroid  of  the  parent 
cluster,  the  event  is  reassigned  to  the  new  cluster.  The 
two  closest  clusters  are  then  fused  and  the  entire  process 
is  repeated  until  the  desired  number  of  clusters  has  been 
obtained.  This  method  has  the  additional  advantage  that  a 
similarity  matrix  need  not  be  stored  in  core  or  on  disk.  On 
Che  other  hand,  this  method  is  several  times  slower  than  the 
hierarchical  procedure. 


Cluster  Analysis  Results 


Because  the  cluster  analysis  procedures  used  in  this 
study  seek  to  minimize  the  total  error  sum  of  squares, 
inclusion  of  variables  which  do  not  reflect  given  group 
differences  can  mask  the  contributions  from  those  which  do 
reflect  such  differences.  For  this  reason,  the  cluster 
analyses  described  here  were  based  on  the  three  most 
powerful  discriminant  variables  obtained  from  the 
discriminant  analyses.  In  this  way  the  tendency  for  the 
data  to  cluster  into  the  actual  earthquake  and  explosion 
populations  is  enhanced. 

Application  of  Ward's  hierarchical  method  and  the 
K-means  iterative  relocation  procedure  to  the  original  set 
of  20  earthquakes  and  27  underground  explosions  shows  that 
the  resulting  clusters  do  not  reflect  actual  group 
divisions.  A  fusion  tree  which  shows  the  cluster  fusion 
history  from  Ward's  method  is  given  in  Figure  (IV-11).  This 
analysis  was  generated  using  variables  4  (Pg/Lg^),  2 
(P2/S2),  and  3  (Pg/B). 

It  has  been  suggested  by  Wishart  (1975)  that  stable 
clusters  are  those  which  require  a  large  increase  in  the 
total  error  sum  of  squares  in  order  to  fuse  them  with  other 
clusters.  With  this  in  mind,  it  may  be  seen  from  Figure 
(IV-11)  that  the  cluster  analysis  of  the  Booker  and 
Mitronovas  data  set  is  stable  at  the  four-cluster  level.  A 


move  to  the  t h r ee -c lus t e r  level  results  in  an  increase  in 


-62- 


the  total  error  sum  of  squares  by  more  than  100  per  cent. 

Scatterplots  are  given  in  Figures  (IV-12)  and  (IV-13) 
which  show  the  cluster  distributions  at  the  four-cluster 
level.  In  both  figures,  variable  4  (Pg/Lg^)  is  plotted 
versus  variable  2  Figure  (IV-12)  was  obtained  from 

the  cluster  groupings  generated  by  Ward's  method,  and  Figure 
(IV-13)  was  obtained  from  those  generated  by  RELOCATE, 
beginning  with  a  random  partition  into  five  initial 
clusters.  The  cluster  memberships  are  identical  in  both 
cases,  although  the  indexing  for  the  clusters  varies  because 
of  the  internal  book-keeping  of  each  procedure.  The  close 
similarities  between  these  two  groupings  indicate  an  optimal 
grouping  because  each  was  obtained  in  a  different  manner- 
However,  in  both  cases,  no  single  cluster  consists 
exclusively  of  earthquakes  or  explosions. 

The  next  step  consists  of  cluster  analyses  similar  to 

those  described  above  for  the  reduced  set  of  events,  both  in 

the  case  where  only  the  seven  energy  ratios  are  used,  and 

where  the  lowest  variance  set  of  M  / m.  ratios  has  been 

s  b 

included.  Figure  (IV-14)  shows  the  fusion  tree  obtained 
using  Ward's  method  where  variables  4  (Pg/Lg^),  2  (P^/S^), 

and  7  (Rg^/Lg^)  are  used.  As  was  found  for  the  original  set 
of  events,  the  reduced  set  shows  no  tendency  to  cluster  into 
the  actual  earthquake  and  explosion  populations.  Moreover, 
the  clustering  process  again  stabilizes  at  the  four-cluster 
level,  with  a  large  increase  in  the  total  error  sum  of 
squares  required  to  move  to  the  three-cluster  level  or 
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below. 

Figures  (IV-15)  and  (IV-16)  are  the  scatterplots 
showing  the  cluster  distributions  obtained  at  the 
four-cluster  level  using  Ward's  method  and  procedure 
RELOCATE-  Again,  aside  from  indexing  differences,  the 
cluster  memberships  are  identical,  which  implies  an 
extremely  stable  cluster  grouping.  In  the  same  manner 
described  above,  the  grouping  generated  by  RELOCATE  was 
obtained  using  a  five  cluster  initial  partition. 

The  fusion  tree  obtained  where  the  smallest  variance 
set  of  Mg/m^  ratios  together  with  variables  4  (Pg/Lg^)  and  7 
(Lg^/Rgj)  have  been  included  is  given  in  Figure  (IV-17). 
With  M  /  m,  included,  the  tendency  to  cluster  into  the 

S  D 

explosion  and  earthquake  populations  is  much  greater.  Two 
earthquake  clusters  (one  containing  an  explosion)  form  at 
the  five  cluster  level.  These  two  clusters  fuse  at  the 
four-cluster  level  to  form  an  earthquake  cluster  with  one 
misplaced  explosion  (Stoat).  However,  this  earthquake 
cluster  fuses  with  the  larger  explosion  cluster  at  the 
two-cluster  level.  Two  clusters  remain  at  the  two-cluster 
level,  where  one  consists  exclusively  of  explosions  and  the 
other  cluster  contains  both  earthquakes  and  explosions. 
Scatterplots  of  H.£/m^  versus  ratio  4  (Pg/Lg^)  are  shown  in 
Figures  (IV-18)  and  (IV-19).  Aside  from  indexing 
differences,  the  cluster  memberships  are  identical  with  one 
exception.  In  the  cluster  grouping  generated  by  RELOCATE, 
the  explosion  which  is  misplaced  in  the  grouping  generated 


by  Ward's  method  is  correctly  placed  into  an  explosion 
cluster  • 
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CHAPTER  V 


CONCLUSIONS  AND  DISCUSSION 


Discriminant  Analyses 

The  discriminant  analysis  procedures  BMD04M  and  BMD07M 

successfully  duplicate  the  results  obtained  by  Booker  and 

Mitronovas  (1964).  Also  verified  in  this  study  are  the 

relative  discriminating  powers  of  each  of  the  seven  energy 

ratios  when  used  alone.  The  ratios,  together  with  their 

F-statistics  are  listed  in  Table  (V-l),  both  for  the  full 

set  of  earthquakes  and  explosions  and  for  the  reduced  set 

where  the  M  /m,  ratios  have  been  included.  The  F-statistics 
s  b 

may  be  used  to  test  the  null  hypothesis  that  the  two 
population  means  are  equal.  In  general,  the  large  values 
found  for  the  F-statistics  support  the  alternative 
hypothesis  that  the  two  population  means  are  unequal. 

Booker  and  Mitronovas  (1964)  claim  an  average  overall 
probability  of  correct  classification  of  0.85.  In  this 
study,  under  the  minimax  assumption  that  the  prior 
probability  of  an  observation  originating  in  either 
population  is  0.5,  the  average  probability  of  correct 
classification  is  0.79  where  the  entire  set  of  events  is 
used,  and  0.78  where  the  reduced  set  is  used.  Where  the 
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quadratic  procedure  is  used,  the  average  probability  of 
correct  classification  is  0.94  for  the  full  set  of  events 
and  0.98  for  the  reduced  set  of  events.  Because  of  the 
small  size  of  the  earthquake  sample,  however,  the  result 
obtained  for  the  reduced  set  of  events  by  the  quadratic 
procedure  should  probably  not  be  taken  as  representative  of 
the  true  probability  of  classification. 

In  addition  to  studying  the  discriminant  potential  of 
each  variable  when  considered  alone,  the  marginal 
contributions  of  the  variables  have  been  studied  using  the 
stepwise  algorithm  of  BMD07M.  In  the  case  of  the  energy 

ratios,  once  (Pg/Lg^)  has  been  entered  at  the  first  step, 

(P^/S^),  (Pg/B),  and  ( Pg  /  [Lg  j+Rg  1  ]  )  lose  much  of  their 

initial  discriminating  power  because  they  are  all  rather 
strongly  correlated  with  (Pg/Lg^).  As  a  result,  once 
(Pg/Lg^)  has  been  entered,  the  other  three  variables  provide 
little  additional  discriminating  information. 

The  estimated  Mg/m^  ratios  are  not  highly  correlated 

with  any  of  the  Booker  and  Hitronovas  energy  ratios. 

However,  this  may  not  be  true  for  observed  Mg/m^  ratios, 

because  the  Mg/m^  ratios  used  in  this  study  are  randomly 

generated  within  specified  variance  constraints  that 

characterize  several  M  /m,  observations  at  NTS. 

s  b 

On  the  other  hand,  this  study  shows  the  enormous  power 
of  the  Mg/m^  discriminant,  even  in  the  pessimistic  case 
where  the  wide  variance  of  Ms/m^  set  4  is  used.  Using  set 
4,  the  average  probability  of  correct  classification  is 


boosted  from  0.78,  where  only  the  short-period  energy  ratios 


are  used,  to  0.96.  This  probability  increases  to  nearly 

1.00  for  the  remaining  sets  of  M  /m,  ratios. 

s  b 

Another  useful  result  obtained  in  this  study  is  the 
successful  classifications  obtained  using  the  quadratic 
discriminant  analysis  procedure.  The  results  obtained  using 
this  procedure  are  found  to  be  at  least  as  good  (in  terms  of 
the  posterior  classification  probabilities)  as  those  from 
the  linear  procedures  for  the  original  set  of  events,  and 
perhaps  even  better  for  the  reduced  set.  Some  caution  must 
be  exercised,  however,  both  in  the  situation  where  the 
sample  sizes  are  small,  and  where  the  actual  distribution  of 
events  differs  from  the  assumption  of  normality.  If  a 
multivariate  model  is  not  appropriate  for  describing  the 
data,  misleading  classifications  might  be  obtained  for 
observations  located  at  points  corresponding  to  the  tails  of 
the  distributions  where  the  probability  densities  are  small. 
On  the  other  hand,  where  the  multivariate  normal 
distribution  is  a  valid  model,  the  quadratic  procedure 
provides  the  best  classification  if  the  covariance  matrices 
differ  markedly. 

An  obvious  flaw  exists  in  the  selection  of  events  for 
this  study.  As  shown  in  Table  (IV-1),  the  explosion  data 
« - :  is  drawn  primarily  from  events  at  NTS,  with  the 

•  .  -  i :  0  n  o  m  e  ,  which  was  detonated  near  Carlsbad,  New 

tv  -  earthquake  data  set  is  drawn  from 
-  ...  j  :  •  and  3  a  j  a  California.  As 
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a  result,  for  the  underground  explosion  population,  there  is 

a  great  deal  of  consistency  In  the  source  locations,  source 

depths,  and  travel  paths  to  stations.  The  earthquake 

population,  on  the  other  hand,  is  very  poorly  constrained  in 

all  these  respects.  This  suggests  that  the  discriminant 

analysis  results  presented  here  might  be  improved  a  great 

deal  by  using  a  more  controlled  earthquake  sample 

population.  This  suggestion  is  supported  by  the  fact  that 

the  Booker  and  Mitronovas  energy  ratios  and  the  M  /m, 

s  b 

discriminant  both  reflect  similar  source  properties.  Energy 

ratios  drawn  on  a  properly  constrained  set  of  earthquakes 

could  be  expected  to  discriminate  at  least  as  successfully 

as  the  M  / m,  ratios  did  in  this  study.  Data  for  a  set  of 

s  b 

properly  constrained  earthquakes  are  needed  to  test  this 
suggestion. 

In  addition  to  the  discriminants  employed  in  this 
study,  any  other  discriminant  may  be  readily  incorporated 
into  the  linear  discriminant  methods  outlined  here.  The 
stepwise  procedure  used  in  BMD07M  makes  this  approach 
particularly  attractive,  in  that  it  may  be  used  to  find 
those  sets  of  variables  where  each  one  is  a  powerful 
discriminant  alone,  and  where  each  one  is  maximally 
uncorrelated  with  the  other  variables  in  the  set.  Such  a 
set  of  nearly  independent  variables  provides  optimal 
classifications  because  errors  introduced  in  any  given 
variable  tend,  on  the  average,  to  be  compensated  by  the 


others  • 
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The  problem  of  missing  data  may  be  solved  quite  easily 
in  a  discriminant  analysis  scheme.  If  several  events  lack 
measurements  on  one  or  more  variables,  the  missing  values 
may  be  replaced  by  the  value  corresponding  to  the  point  of 
intersection  between  the  hyperplane  which  optimally 
separates  the  two  populations  and  the  line  connecting  the 
two  population  means.  This  point  may  be  found  by  generating 
a  discriminant  analysis  where  the  events  containing  missing 
data  values  have  been  omitted.  In  such  a  procedure,  the 
variables  corresponding  to  the  missing  values  do  not 
contribute  to  the  discriminant  analysis  for  events  with 
missing  data.  Instead,  these  events  are  classified  using 
only  the  variables  for  which  measurements  are  available. 


Cluster  Analyses 


From  the  results  obtained  above,  it  may  be  seen  that 
cluster  analysis  represents  a  less  useful  tool  for 
classification  of  seismic  events  than  linear  and  quadratic 
discriminant  analysis.  The  reasons  for  this  can  probably  be 
traced  to  the  fact  that,  in  the  case  of  the  discriminant 
analyses,  the  group  memberships  and  the  number  of  groups  are 
specified  as  constraints  on  the  problem.  The  purpose  of  the 
discriminant  analysis  is  to  provide  the  linear  or  quadratic 
function  which  best  separates  the  two  populations,  given  the 
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input  information  as  to  the  number  of  groups  and  the  sample 
memberships-  In  the  case  of  cluster  analysis,  no  such  prior 
information  is  available;  the  data  alone  must  suggest  the 
appropriate  partitioning  and  the  optimum  number  of  groups. 
In  this  way,  cluster  analysis  best  operates  as  a  hypothesis 
generator.  That  is,  where  no  prior  information  is  available 
about  the  data,  cluster  analysis  methods  might  be  used  to 
generate  information  about  intrinsic  group  structure.  This 
information  could  then  serve  as  the  input  to  a  discriminant 
analysis  scheme.  Such  a  strategy  is  of  little  use  in  the 
present  study,  however,  because  the  group  attributes  of  the 
data  are  already  known,  and  are  not  those  generated  by  the 
cluster  analyses. 

One  interesting  result  which  did  arise  from  the  cluster 
analyses  generated  in  this  study  is  the  tendency  for  the 
data  to  form  stable  groups  at  the  four-cluster  level.  No 
explanation  could  be  found  for  this  phenomenon.  No 
correlations  were  observed  between  the  cluster  formation  and 
the  event  source  locations,  average  source-receiver 
distances,  or  body  wave  magnitudes.  It  may  be  possible  that 
these  cluster  formations  result  from  similar  source  depths 
for  those  events  in  the  same  cluster,  or  from  some  effect 
which  may  be  traced  to  the  azimuthal  distribution  of 
stations.  However,  no  information  is  available  for  these 
events  concerning  either  the  stations  used  to  obtain  the 
energy  ratios  or  the  earthquake  focal  depths. 

Finally,  the  incorrect  classifications  obtained  using 

f 


cluster  analysis  in  all  but  the  case  where  the  M  /m.  ratios 

3D 

are  included  indicates  that  cluster  analysis  provides  useful 
groupings  only  where  the  b e t wee n-group  scatter  is 
significantly  greater  than  the  within-group  scatter,  or 
where  the  groups  are  highly  disjoint. 


S ugge s t ions  f or  Improving  D is cr iminat ion  Capabilities 
based  on  Recent  Advances  in  Seismic  Instumentation 
and  Signal  Process ing  Techniques 


Improved  discrimination  capabilities  may  be  provided  by 
utilizing  several  recent  advances  in  seismic  instrumentation 
and  signal  processing  techniques.  Because  new  discriminants 
are  readily  incorporated  into  the  framework  of  the 
discriminant  analysis  techniques  presented  in  this  paper,  it 
is  possible  to  construct  training  sets  of  data  and  to 
upgrade  them  at  a  later  time  as  new  discriminants  are 
developed . 

Seismic  instrumentation  has  been  greatly  improved  in 
recent  years.  A  primary  example  is  the  development  of  the 
Seismic  Research  Observatory  (SRO)  network  of  which  13 
stations  are  now  in  operation  world-wide.  The  instruments 


used  in  the  SRO  network  are  characterized  by  a  very  broad 
dynamic  range,  so  that  a  wide  magnitude  range  of  events  may 
be  recorded  without  altering  the  frequency  response 
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characteristics  of  the  system.  Because  the  SRO  seismometers 
are  situated  in  deep  boreholes,  exceptionally  high  signal  to 
noise  ratios  are  realized.  The  data  recorded  at  the  SRO 
stations  are  digitized  and  stored  on  magnetic  tape  volumes. 
This  compact  means  of  data  storage  allows  for  convenient 
retrieval  of  data  for  subsequent  processing.  In  particular, 
the  on-site  conversion  of  analog  data  to  digital  format 
allows  quite  sophisticated  data  processing  techniques  to  be 
applied  routinely  and  economically. 

A  seismic  network  such  as  the  SRO  network  should  prove 

invaluable  in  the  seismic  discrimination  problem.  One  of 

the  reasons  that  the  M  /m,  discriminant  has  been  so 

s  b 

successful  is  that  it  may  be  easily  and  routinely 
calculated.  The  digital  format  of  the  SRO  data  allows  more 
sophisticated  discriminants  to  be  calculated  as  routinely  as 
Mg/m^,  and  moreover,  because  such  measurements  may  be 
completely  automated,  the  introduction  of  human  error  in 
measurement  and  digitization  may  be  minimized. 

By  using  seismograms  from  a  system  such  as  the  SRO 
network,  it  is  possible  to  incorporate  discriminants  based 
on  frequency  domain  analyses  into  the  classification 
procedure.  Spectral  discriminants  have  been  suggested  as  a 
means  for  improving  discrimination  capabilities  (Weichert, 
1971;  SIPRI,  1968).  By  using  spectral  ratios  in  the 
discriminant  analysis  procedures  it  should  be  possible  to 
combine  the  effects  of  differences  in  energy  partitioning  at 
the  source  with  those  resulting  rom  different  source 
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dimensions  and  time  histories  for  the  two  types  of  events. 

A  facility  has  been  provided  to  calculate  these 
spectral  ratios  in  the  seismoprint  program  described  in 
Appendix  C.  The  seismoprint  program  generates  spectral 
analyses  at  two  different  levels.  At  the  first  level,  the 
program  obtains  a  spectrogram  for  the  event  being  studied. 
The  spectrogram  consists  of  a  two-dimensional  contour 
display  of  power  spectral  density  as  a  function  of  time  (or 
velocity)  and  frequency.  At  the  next  level,  spectra  within 
selected  time  (or  velocity)  windows  may  be  obtained  by 
averaging  the  spectra  used  to  obtain  the  event  spectrogram. 
In  addition,  these  spectra  may  be  generated  over  any 
frequency  range  of  interest  within  the  available  bandwidth. 
The  spectra  may  then  be  used  in  the  formation  of  spectral 
ratios  • 

An  area  which  shows  considerable  promise  is  the 
development  of  a  classification  scheme  based  on  the  event 
spectrogram.  This  is  especially  interesting  because 
spectral  information  in  the  entire  seismogram  could  be 
utilized  in  a  cluster  analysis  or  pattern  recognition 
scheme.  Spectrograms  are  shown  in  Figures  (V-l)  and  (V-2) 
for  the  Shoal  explosion  and  the  Fallon  earthquake.  The 
spectrograms  for  these  events  exhibit  remarkable  differences 
in  the  high  frequency  spectral  content  and  in  the  relative 
partitioning  of  energy  between  the  body  and  surface  wave 
radiation  fields. 

Application  of  pattern  recognition  techniques  to  the 
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event  spectrogram  has  received  little  attention  in 
seismology*  However,  in  light  of  the  possibility  of 
automating  a  pattern  recognition  scheme  using  digitized  data 
such  as  that  available  from  the  SRO  network,  further 
research  in  this  area  is  warranted. 
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APPENDIX  A 


THREE  CLASSIFICATION  CRITERIA  OFTEN  USED  IN 
DISCRIMINANT  ANALYSES 


Decision  Proba b ilitie  s  and  Cos t  Functions 


For  all  of  the  criteria  which  follow,  if  the  random 
vector  x  falls  within  some  region  R^  (determined  from  the 
criterion),  it  is  identified  with  the  population  A^.  It  is 
possible  to  formulate  various  decision  probabilities  in 
terms  of  integrals  of  the  appropriate  likelihood  functions 
over  the  regions  R^.  Let  D^_.  represent  the  decision  that 
the  event  in  question  originates  in  the  it^1  population  when 
it  actually  comes  from  the  population.  In  terms  of  the 
hypotheses  H^,  this  is  equivalent  to  choosing  when  is 
actually  true.  The  probability  of  making  this  decision  is 
given  by : 


P/t  f  dij  J 


Pjfi)  atV- 


(A  -  I) 


This  is  illustrated  for  the  error  probabilities  ?r{Dp)  and 
Pr{D2^>  for  the  two  group  univariate  case  in  figure  (A-l>. 
The  average  probability  of  correctly  classifying  tbe  r  \  •  ~ 
vector  x  is  given  by: 
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(A  -  2) 


Similarly,  the  average  probability  of  misclassifying  x  is 
given  by 


H  -  t  t 

l‘>  Jtl 


/  -  P e. 


(A  -  3) 


In  many  cases  it  is  desirable  to  weight,  or  assign 
costs  to  the  various  decisions.  If  denotes  the  cost 
associated  with  the  decision  D_,  the  average  cost  (i.e., 
the  cost  function)  associated  with  classifying  the  random 
vector  "x  is  given  by: 


1  J 

c  *  2.  >  Ci:  P*  fDiji  PsfHjl 

/•I  it,  7 


(A  -  4) 


In  general,  the  cost  associated  with  making  a  correct 
decision  is  set  equal  to  zero.  This  convention  will  be 
followed  here. 


The  Bayes  Criterion 

The  Bayes  criterion  dictates  that  x  be  classified  in 
such  a  way  that  the  cost  function  is  minimized,  which  is 
equivalent  to  determining  the  regions  so  that  C  is 
minimized.  For  the  two  population  case,  assuming  that  ■ 


C  *  C,t  Pa  f  0,»  }  Pa  }  Ml  /  *  Pa  (bn  f  PA  fft} 


(A  -  5) 


Letting  Pr{D21>  -  1  -  Pr{D^>,  and  Pr{H2>  -  1  -  PrtH^},  the 
average  cost  may  be  rewritten: 


C  »  Ct, 


t 


Cnf1'  P*  f»,i)  -  Ci, 


6) 


Because  is  the  only  variable  quantity  in  (A-6),  in  order 

to  minimize  C,  R^  must  be  chosen  to  include  only  those 

values  x  for  which 
o 

C,A  (i  -  P«  \H\)  ?*(**)  -  (A  -  7) 

This  result  may  be  stated  in  terms  of  the  likelihood  ratio 
test  for  Hj  as : 

Choose  H ^  if : 

P1(xo)/p2(*o)  >  C12(l-Pr{H1>)/C21Pr{H1>, 

choose  H2  otherwise. 

It  should  be  noted  that  if  C^2  -  and  «  C22  ■  0,  the 

average  cost  reduces  to  the  average  probability  of  error. 
This  special  case  of  the  Bayes  criterion  which  minimizes  P 


I 
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is  known  as  the  minimum  error,  or  maximum  a  posteriori 
probability  criterion. 


The  Neymann-Pearson  Criterion 

The  Neymann-Pearson  criterion  is  quite  useful  in  the 
two  population  case,  in  that  it  does  not  require  either 
costs  or  prior  probabilities.  Under  this  criterion,  the 
regions  R^  are  chosen  so  that  Pr{D  }  is  maximized,  while 
Pr{D^j >  is  set  to  a  specified  threshold  value,  say  z.  Since 
Pr{Dii>  »  1  -  PrtD^},  this  procedure  is  equivalent  to 

minimizing  PriD^).  Suppose  PrtD^J  is  set  at  Zm  Then 
Pr{D21>  is  to  be  minimized  subject  to  the  constraint: 
Pr{Di2>  =  z.  Introducing  the  Lagrange  multiplier^,  the 
quantity  to  be  minimized  is  given  by  Q,  where: 

()  t  /i«j  ♦  ^  J  (A  -  8) 

Expression  (A-8)  is  the  Bayes  cost  where  »  C22  "  0, 

C 1 2 ( 1  — P r { H  1  > )  «  A,  and  C2^Pr{H1>  *  1,  so  that  R ^  is  chosen 
such  that: 

/I  fi*(X  )  -  p,Cr)  *  o  (A  -  9) 


4 


or  in  terms  of  the  likelihood  ratio  test  for  H^: 
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Choos e  H ^  if : 

Pl(?o)/p2(V  *  A> 

choose  H£  otherwise* 


The  Minimax  Criterion 

Another  criterion  which  is  widely  used  where  the  prior 
probabilities  are  unknown  is  the  minimax  criterion,  which 
dictates  that  the  decision  regions  be  chosen  in  such  a  way 
as  to  minimize  the  maximum  average  cost.  That  is,  the 
minimax  criterion  implicitly  specifies  that  the  prior 
probabilities  be  chosen  to  correspond  to  the  maximum  on  the 
Bayes  cost  curve.  This  is  illustrated  in  figure  (A-2). 

If  the  Bayes  cost  is  viewed  as  a  function  of  PrtH^ 
with  the  and  Pr{D^j>  fixed,  it  has  a  shape  similar  to 

that  shown  in  figure  (A-2).  The  Neymann-Pearson  criterion 
has  the  effect  of  linearizing  the  average  cost.  The 
Neymann-Pearson  cost  is  quite  close  to  the  Bayes  cost  for 
values  of  PrO^}  close  to  A.  However,  as  Pr{Hi>  approaches 
and  exceeds  B,  the  Neymann-Pearson  cost  becomes  quite  large. 
An  alternative  is  to  choose  the  decision  regions  and 

so  the  maximum  possible  cost  is  minimized.  This  is  done  by 
the  minimax  criterion. 

The  maximum  on  the  Bayes  cost  curve  is  found  by 
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differentiating  the  Bayes  cost  with  respect  to  the  prior 
probability  Pr{H^>,  which  is  considered  as  an  independent 
variable,  The  result  is  then  set  to  zero: 

jj  1  f pfc.. 

*  Ci,  Pa  j J  -  C,t  Pa l &/ij  ~  10) 

r  O. 

The  condition  for  the  minimax  criterion,  then,  requires  that 
the  decision  regions  be  chosen  so  that 

C/2  Pa  f  P/l}  *  Cif  Pa  ft  ht  /  (A  -  11) 


The  minimax  cost  is  found  by  substituting  the  result  from 
( A- 11)  into  the  expression  for  the  Bayes  cost  given  in 
(A-6),  so  that  the  minimax  cost  becomes: 


C  --  Cz,  Pa  f  Pi,  j  ■  C,2  P*  f  b'l] 


(A  -  12) 


Because  C  determines  and  ^r^21^'  the  decision 

regions  and  are  those  regions  satisfying: 


i 


Pid)  dv 


•  /  />• 
'Kx 


p.(x)  dV. 


(A  -  13) 


Again,  this  result  may  be  expressed  in  terms  of  the 


likelihood  ratio  test  for  H^: 
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Choose  if: 

p1(x0)/p2(^2)  >  /tM, 

choose  otherwise, 

where  Au  is  chosen  to  satisfy  the  minimax  criterion.  The 
minimax  cost  is  independent  of  Pr(H^),  and  is  tangent  to 
the  maximum  on  the  Bayes  cost  curve  as  shown  in  figure 
(A-2 )  . 


The  minimax  cost  represents  the  best  criterion  where  no 
information  is  available  about  the  prior  probabilities 
Pr{H^}-  If  Pr{H^>  is  known  to  be  in  some  region  such  as  0  < 
PriH^}  <  B,  as  shown  in  Figure  (A-2),  then  the 
Neymann-Pearson  criterion  may  yield  a  smaller  average  cost, 
and  be  better  suited  to  the  problem.  However,  if  the  prior 
probabilities  are  known,  the  Bayes  criterion  always  yields 
the  minimum  costs. 

Each  of  the  criteria  discussed  above  leads  to  a  simple 
likelihood  ratio  test  in  the  two  population  case.  Multiple 
populations  may  be  as  easily  dealt  with  by  considering  the 
Bayes  cost  given  in  (A-6): 

J  J 

C  «  >  >  Cij  ?A  {&>j}  p*  fHjf.  ( A— 14) 

/'«/ 


The  Bayes  criterion  requires  that  this  average  cost  be 


minimized,  which  implies  chat  for  any  realization  of  x,  say 
x  ■  x  ,  the  hypothesis  chosen  must  be  the  one  which  yields 
the  lowest  average  cost.  The  cost  associated  with  is 

given  by: 

J  r  , 

Ci  »  b  Cji  {  (A  -  15) 

Pr{H^  |  x**xo  >  is  given  by  Bayes'  rule: 

/»/vr. )  A  / A  / 

fid.)  (A  -  16) 

/>;?*,)  A  f  //;  ( 

fifX.)  (A  -  17) 


A  f  f  *  **•} 

so  that 


i  •/ 


Since  the  marginal  density  of  x,  p(x),  is  common  to  all 
terms,  the  decision  rule  is  to  choose  the  hypothesis  which 
minimizes : 

J 

--  l  Cji  pi  ft.)  A  / Mil  (A  -  18) 

The  expression  above  is  the  one  derived  by  Whalen 
(1971).  An  equivalent  expression,  the  discriminant  score, 
is  given  by  Rao  (  1973  ).  Given  that  x  -  "x  q  , 


the  discriminant 
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APPENDIX  B 

THE  QUADRATIC  DISCRIMINANT  ANALYSIS  PROGRAM, 

QD I SCRIM 

Program  QDISCRIM  may  be  used  to  generate  a  quadratic 
discriminant  analysis  for  a  set  of  sample  vectors  from  as 
many  as  ten  groups.  The  maximum  number  of  variables  which 
may  be  included  in  the  analysis  is  also  limited  to  ten.  The 
maximum  number  of  sample  vectors  within  each  group  is 
limited  to  100.  QDISCRIM  classifies  the  input  observations 
on  the  basis  of  the  relative  magnitudes  of  the  equivalent 
discriminant  score  (Rao,  1973;  Whalen,  1971)  for  each  group. 
Each  sample  vector  is  classified  into  that  group  for  which 
its  equivalent  quadratic  discriminant  score  is  greatest. 
The  sample  vectors  are  assumed  to  have  a  multivariate  normal 
distribution  within  each  group. 

The  quadratic  procedure  has  the  advantage  that  no 
assumption  need  be  made  concerning  the  equality  of  the 
covariance  matrices  for  each  group.  This  assumption  is 
implicit  in  all  linear  discriminant  analysis  techniques.  On 
the  other  hand,  the  quadratic  procedure  relies  most  heavily 
on  the  assumption  of  normality  because  the  boundary  for  the 


classification  regions  often  coincides  with  areas  far  from 
the  means  of  the  respective  likelihood  functions,  where  the 


! 
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probability  densities  are  small  (Anderson  and  Badahur, 
1962).  Because  the  sample  covai lance  matrices  are 
calculated  for  each  group  individually,  caution  must  be 
exercised  to  make  sure  that  the  number  of  observation 
vectors  in  each  group  exceeds  the  number  of  components  in 
the  observation  vector  so  that  the  sample  covariance 
matrices  are  non- s ingu la r . 


Computational  Procedure 


Let  x^  ,  .  .  .  |  >  •  •  •  » j  ,  .  .  .  represent  observation 
vectors  from  g  groups,  where  is  the  number  of 
observations  in  the  ith  group.  The  first  step  in  QDISCRIM 
consists  of  estimating  the  mean  vectors  (x.^)  and  the  sample 
covariance  matrices  ( S  ^ )  for  each  group.  For  the  itl1  group, 
x^  and  are  given  by: 


Mi 

z .  L  \ 

*'  *  //;  ’ 
'  r' 


a-' 


(B  -  1) 


Because  the  sample  covariance  matrices  are  positive 
definite,  the  determinant  of  is  given  by 

/  /  *  7f  a;  . 

r-> 


(B  -  2) 


where  ^ j  is  the  j  eigenvalue  of  the  sample  covariance 
matrix  for  the  it'1  group.  The  eigenvalues  are  obtained 
using  the  Jacobi  method  employed  in  the  PSU  library 
subroutine  DMXDJA.  Finally,  each  sample  covariance  matrix 
is  inverted  using  the  PSU  library  subroutine  DMXINV. 

The  next  step  in  the  QD1SCRIM  procedure  involves 
evaluating  the  equivalent  quadratic  discriminant  score  for 
each  observation  vector,  and  classifying  the  vector  into 
that  group  for  which  the  quadratic  score  is  greatest.  This 
is  equivalent  to  classifying  the  observation  into  the  group 
which  maximizes  the  probability  that  the  hypothesis 
(i.e.,  that  originates  in  the  j1"'1  group)  is  true.  This 

probability  may  be  formulated  in  terms  of  the  likelihood 
functions  p^Cx)  by  applying  Bayes  rule: 

/P/YrJ  P*  / 


/> /'r.; 


(B  -  3) 


In  (B-3),  p_^(xQ)  is  the  likelihood  function  (the  conditional 

probability  density  function  of  x,  given  that  is  true) 

for  the  1th  population  evaluated  at  x  =  x  ,  where  x  is  a 
r  r  o  o 

specific  realization  of  the  random  vector  x  corresponding  to 
each  of  the  sample  vectors.  p(xQ)  is  the  marginal 
probability  density  of  x,  also  evaluated  at  xq.  PrOl^}  is 
the  prior  probability  that  xq  originates  in  the  i*"'1  group, 
and,  in  the  case  of  QDISCRIM,  may  be  specified  by  the  user. 
The  likelihood  functions  p^(x)  are  estimated  by: 


-92- 


f>,f  X)  *  (ir)  I  s  /  %  j  ~  i  (  X  -  X,  )  S ,  (  X  -  X, )  / 

M~*>  ,„)*TsTk  eyP  f’*'*-*^  ** 


(B  -  4) 


and  the  marginal  density  of  x  is  given  by: 

i 

pfx)  -  T  pi(X)  Pa  iff;]  (B  -  5) 

i-.i 

The  classification  strategy  used  in  QDISCRIM  involves 
identifying  with  tne  group  which  maximizes  the  right  side 
of  (B-3).  Since  p(x)  and  the  factor  involving  2  are  common 
to  all  groups,  they  may  be  deleted  from  the  comparison. 
Also,  because  the  natural  logarithm  is  a  strictly  increasing 
function,  it  is  sufficient  to  classify  xq  into  the  group 
which  maximizes: 


S'  a  (  Pst  f//,f )  -  £/ r-  ?,■  )Si'/ X- y,- ) -■£ /$/<  (B  -  6) 


The  expression  given  in  ( B  —  6 )  is  the  equivalent  discriminant 
score  (Rao,  1973;  Whalen,  1971).  For  each  sample  vector, 
QDISCRIM  evaluates  (B-6)  for  each  group,  then  classifies  the 
sample  vector  into  the  group  for  which  0  is  maximal. 
Finally,  QDISCRIM  evaluates  and  lists  the  classification 
probabilities  given  in  (B-3)  for  each  observation  vector  in 
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APPENDIX  C 


THE  SEISMOPRINT  PROGRAM 


The  seismoprint  program  is  a  comprehensive  spectral 
analysis  program  developed  to  analyze  seismic  signatures  in 
several  ways.  In  the  context  of  the  earthquake/underground 
explosion  discrimination  problem,  it  may  be  used  to  generate 
energy  ratios,  either  from  spectra  over  the  entire  bandwidth 
of  the  recording  instrument,  or  from  spectra  within  selected 
frequency  bands.  In  addition,  the  seismoprint  program 
generates  the  event  spectrogram,  the  seismological  analogue 
to  the  voiceprint  used  in  linguistics  analysis.  This 
suggests  that  pattern  recognition  techniques  based  on  the 
event  spectrogram  might  also  be  adapted  to  the 
discrimination  problem,  a  possibility  that  remains  largely 
unexplored . 


Program  Description 


The  seismoprint  program  consists  of  a  main  driver  and 
three  primary  subroutines.  The  main  driver  reads  the  input 
and  control  parameter  cards,  and  directs  control  to  each  of 
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Che  three  primary  subroutines  depending  on  the  tasks  the 
program  is  to  accomplish. 

Subroutine  SPXFRM  reads  the  input  seismogram  from  tape 
and  generates  the  power  spectra  of  consecutive,  possibly 
overlapping  segments  of  the  seismogram  at  uniform  time 
increments.  The  power  spectrum  at  each  time  point  is 
obtained  by  removing  the  mean  and  linear  trend  from  the 
desired  segment  of  data,  applying  a  cosine  taper  to  the  ends 
of  the  window,  then  extending  the  window  with  zeros  to  the 
specified  power  of  two.  The  finite  Fourier  transform  of  the 
data  is  calculated  using  the  Cooley-Tukey  fast  Fourier 
transform  algorithm,  then  the  periodogram  power  spectral 
density  estimate  is  generated  by  obtaining  the  squared 
modulus  of  the  finite  Fourier  transform.  The  periodogram 
estimate  may  be  smoothed  at  the  user's  option.  These  power 
spectra  are  then  transferred  to  disk  to  await  subsequent 
processing. 

Subroutine  SPCPLT  can  be  called  to  generate  the 
spectrogram  plot  in  one  of  two  possible  ways.  The  first 
option  provides  a  two-dimensional  printer  contour  plot  where 
representation  of  the  contour  intervals  is  achieved  by 
overstriking  selected  print  characters,  so  that  the  contour 
levels  are  indicated  by  varying  ink  densities.  The  second 
option  provides  a  hidden-line  perspective  plot  which  is 
generated  by  the  PSU  library  subroutine  HIDE.  In  both 
cases,  the  user  has  the  option  of  normalizing  the  power 
spectrum  at  each  time  index  with  respect  to  the  maximum  in 
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chat  spectrum.  This  type  of  display  is  quite  useful  for 
studying  multiple  events,  in  that  spectral  minima  tend  to 
persist  throughout  the  spectrogram,  whereas  other  effects 
are  randomized  by  the  normalization  process  (Bell,  1977). 

The  spectrogram  may  be  displayed  either  as  a  function 
of  time  and  frequency  or  as  a  function  of  velocity  and 
frequency.  Where  the  velocity  display  is  chosen,  the  event 
spectra  of  the  time  indices  closest  to  the  specified 
velocities  are  plotted  at  each  velocity  index.  Such  a 
display  introduces  a  scaling  effect  so  that  spectrograms  for 
events  with  different  source-receiver  distances  may  be 
directly  compared. 

Subroutine  SPSPLT  generates,  at  the  user's  option, 
plots  of  both  the  input  seismogram  and  power  spectra  of 
selected  time  and/or  velocity  windows.  The  power  spectra 
for  these  selected  windows  are  obtained  by  averaging  the 
spectra  of  the  uniformly  spaced  segments  of  the  seismogram 
that  are  generated  by  SPXFRM.  These  spectra  may  also  be 
saved  on  disk  or  tape  for  subsequent  processing,  and  may  be 
used  to  generate  spectral  ratios  for  a  seismic 
classification  scheme. 
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The  technique  used  in  subroutine  SPSPLT  to  obtain 
spectra  for  the  specified  time  and  velocity  windows  involves 
averaging  the  spectra  of  shorter,  possibly  overlapping 
windows  that  are  generated  by  SPXFRM.  This  is  one  procedure 
suggested  by  Cooley,  Lewis,  and  Welch  (1967)  to  obtain  a 
consistent  power  spectral  density  estimator. 

The  statistical  properties  of  both  the  periodogram  and 
weighted  covariance  power  spectral  density  estimators  are 
considered  in  some  detail  by  Blackman  and  Tukey  (1959), 
Cooley,  Lewis,  and  Welch  (1967),  Koopmans  (1974),  and 
Kanasewich  (1975).  Koopmans  (1974)  provides  a  detailed 
discussion  of  the  distribution  of  the  periodogram  spectral 
estimator  under  white  noise  assumptions,  and  the  asymptotic 
distribution  where  the  input  sequence  is  correlated  and 
non-Gausslan.  Koopmans  (1974)  and  Cooley,  et  al.  (1967) 
demonstrate  that,  under  fairly  weak  conditions  on  the  length 
of  the  data  and  the  smoothness  of  the  true  power  spectrum, 
the  asymptotic  distribution  of  the  periodogram  may  be  used 
as  a  reasonable  approximation  to  the  true  distribution. 
Under  these  assumptions,  the  periodogram  power  spectral 
density  estimates  are  approximately  independently  and 
identically  distributed  proportional  to  chi-square  with  two 
degrees  of  freedom,  except  at  the  frequency  points 


corresponding  to  ~&T »  0,  7f,  where  the  estimates  are 
distributed  approximately  proportional  to  chi-square  with 
one  degree  of  freedom. 

Although  bias  may  be  introduced  through  spectral 
leakage,  which  results  from  either  (or  both)  truncation 
effects  or  a  rapidly  varying  true  power  spectrum,  the 
periodogram  estimate  is  asymptotically  unbiased.  However, 
the  mean-squared  error  remains  constant  in  the  asymptotic 
limit  (a  consequence  of  the  asymptotic  chi-square 
distribution),  so  that  the  simple  periodogram  is  not  a 
consistent  estimator  for  the  power  spectral  density.  Two 
techniques  are  available  for  generating  consistent  spectral 
estimators  based  on  the  periodogram.  The  smoothed 
periodogram  is  discussed  in  detail  by  Koopmans  (1974).  The 
second  method,  that  of  spectral  averaging,  is  due  to  Cooley 
et  al.  ( 1967) . 

This  technique  for  obtaining  a  consistent  spectral 
estimate  is  illustrated  in  Figure  ( C  —  1 ) .  The  original  M 
sample  record  is  broken  into  K  sections,  where  each  section 
is  L  samples  long,  and  is  offset  from  the  previous  record 
section  by  D  samples.  Periodogram  estimates  are  calculated 
for  each  section,  and  the  spectral  estimate  is  obtained  by 
averaging  o^er  these  per iodograms . 

The  expression  for  the  variance  of  this  estimator 
involves  tne  lag  windows  used  to  weight  and  truncate  the 
series,  so  that  the  weights  w(j)  must  be  explicitly  included 
in  the  following  derivations.  In  the  case  where  the  series 
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is  simply  truncated,  these  weights  correspond  to  the  simple 
boxcar  window. 

The  modified  periodogram  for  the  pth  record  segment  is 
given  by: 


JTU  / 


(C  -  1) 


where 
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d.  w-V  j ) 
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(C  -  2) 


and  A^(n)  is  the  finite  Fourier  transform  of  the  pth 
windowed  record  segment,  given  by: 
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The  desired  spectral  estimate  is  given  by  the  average  over 
these  periodograms : 


p  tw.'i 


-  c'' 

JEp  (  Vm). 


(C  -  4) 


As  before,  the  1^(1^)  are  independently  and  identically 
distributed  proportional  to  chi-square  with  two  degrees  of 
freedom,  except  at  m0  and  tr  m  it  •  At  these  points  the 
Ip(*r)  are  asymptotically  distributed  proportional  to 
chi-square  with  one  degree  of  freedom.  Writing  1^(1^)  in 
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terms  of  the  cyclic  lagged  product  of  the  w(j)x(j)  series: 
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the  expected  value  of  p(ur^)  is  given  by: 
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where 
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If  the  true  power  spectrum  p  (*r)  is  effectively  constant 
over  the  bandwidth  of  h  (*f)  ,  E{p(»r)}  becomes: 


m 


**<•)}  ®  P  f  A^fir)  a/ ZJ- 

'-t r 

~  p(t~ >„)  t  S/’ret 


(C  -  8) 


which  shows  that  j7(%r)  is  approximately  unbiased  for 
sufficiently  large  L • 

The  variance  of  {?(■&£■)  is  quite  difficult  to  obtain  for 
small  L.  However,  for  sufficiently  large  L  (and  M) ,  the 


asymptotic  conditions  given  above  are  valid  to  a  good  degree 
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of  approximation  (Koopmans,  19  7  4)  •  The  variance  of  p(?^r)  is 
given  ^y: 
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Substituting  the  expression  for  I  (»^)  given  in  (C-4)  and 
rearranging : 

c»*s  f JQf /**•/ 

/ is*.)  J  -  (C  —  10) 

f  c->  l-/  trt  l;i 

'  yjrvV  r>:  >  >  *•>  W(l*A.)wf+)  Wfi*v)  X 

*•*  »»• 

X  ^  XffS  *mJ  X/t+jft)  Xp  tjft*  v)  J  £ 

For  sufficiently  large  L,  the  series  x^(j)  may  be  viewed  as 
approximately  Gaussian  and  independent.  This  corresponds  to 
p  (;w)  being  approximately  constant  over  hffl(v)  ,  and  1^(1^) 
being  asymptotically  distributed  proportional  to  chi-square. 
Under  these  conditions,  the  fourth  moment  in  (C-10)  may  be 
expanded.  First,  since  x^+j  (k)  «■  x^Ck+jD): 
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*  &t,  i  *  j D*  i/  &  t*M. ,  4+ jb  ) 

Substitution  of  the  delta-functions  in  ( C  —  1  1 )  into  (C-10) 
reduces  the  number  of  summations  from  four  to  two.  After 
carrying  through  the  algebra,  (C-10)  reduces  to: 
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Three  points  are  worth  noting  here.  In  the  case  where 

D  >  L,  w(k+1D)  -  0  for  k-0,...,L-l,  so  that  I  (ts)  and 

—  P  n 

Ip+j(*r^)  are  uncorrelated.  This  situation  occurs  where  the 

L-sample  sections  do  not  overlap.  The  second  point  concerns 

the  variance  of  the  I  ( ts- )  ,  which  may  be  found  from  (C-12) 

P  n 

by  setting  j  equal  to  zero: 
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This  is  indeed  the  result  found  above  for  the  simple 

per iodogram.  Finally  the  correlation  between  I^Csr)  and 

Ip+j('4^)  may  be  expressed  in  terms  of  the  w(k)  in  ( C  —  1 2 )  and 

(C-13).  Since  corr{I  (tr)I  ..(*/')}  is  given  by: 

p  n  p  +j  n 
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p(j)  may  be  written: 
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p(j)  may  now  be  substituted  into  (C-9),  so  that  var{p(*r)> 
becomes 


VA/i  /X*  t*1")  ( 


j*  +*  E  (i-  vK)r<i) J  (c  "  16) 


where  D  _>  L  (i.e.,  the  sections  do  not  overlap),  the 

2 

variance  of  p(jjr)  is  simply  ( 1  /K )  va  r  {  I  (  v  )  >  *  p  (&/j)/K. 
Because  this  variance  goes  to  zero  as  K  becomes  large,  p(j.r) 
is  a  consistent  estimator.  If  M  becomes  large  and  L  is  held 
constant,  then  K  also  becomes  large,  with  the  result  that: 
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//#*  vol*  ( £/u*)l  z  /•'**  -£■  -  0  (c  -  17) 

nt~9  *>  ■  '  J  k-*o*  £ 

Where  D  <  L,  the  series  involving  ^(j)  must  be 
included.  However,  because  the  |0(j)  decrease  quickly  with  j 
(Cooley,  et  al.  ,  1967),  the  consistent  property  of  the 
estimator  is  retained,  and  in  some  cases  enhanced.  For 
fixed  M  and  L,  the  presence  of  the  p(j)  in  the  numerator  of 
(C-16)  tends  to  inflate  the  variance.  On  the  other  hand, 
because  the  segments  overlap,  K  is  larger  here  than  it  is  in 
the  case  of  non-overlapping  sections,  which  results  in  a  net 
decrease  in  the  variance.  Finally,  it  should  be  noted  that 
for  sr  -  0  or  ^  =  T ,  the  usual  doubling  in  the  variance 
occurs  due  to  the  loss  of  a  degree  of  freedom  in  the 
approximating  chi-squared  distribution. 


tic  representation  of  the  classification  reg 
uadratic  discriminant  analysis  problem.  The 
groups  are  denoted  by  p.  and  p.,  the  classif 


0:  Earthquake 
X:  Explosion 


FIGURE  IV-6 :  Scatterplot  of  1st  canonical  variable  (Var.  I)  vs.  2nd  canonical 
variable  (Var.  2)  for  reduced  set  of  9  earthquakes  and  21  explosions.  7  ener 
ratios  were  used.  X  denotes  an  explosion,  Q  denotes  an  earthquake. 


FIGURE  IV-IO:  Scatterplot  of  1st  canonical  variable  (Var.  1)  vs.  2nd 
variable  (Var.  2)  for  reduced  set  of  9  earthquakes  and  21  explosions 
ratios  and  Ms/mb  set  4  were  used.  X  denotes  an  explosion,  Q  denotes 
earthquake . 


URE  (IV  -  12)  Scacterplot  at  four-cluster  level  of  full  set  (20  eq.,  27  ex 
of  events.  Ward's  method  was  used.  Arrows  indicate  earthquakes.  Pg/V-g  is 
plotted  horizontally,  and  P_/S.  is  plotted  vertically.  1 
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ENERGY  RATIOS  WITH  MS  /  MB.  SET  1.  3  VARIABLES  WARD’S  METHOD. 
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FIGURE  (IV  -  18)  Scatterplot  at  four-cluster  level  of  reduced  set  (9  eq.,  21  ex.) 
of  events  with  Ms/n>b  Included.  Ward's  method  was  used.  Arrows  Indicate 
earthquakes.  Mg/raf,  Is  plotted  horizontally,  Pg/Lg  Is  plotted  vertically. 
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FIGURE  (A  -  2)  Neyraann-Pear son  (l)T  minimax  (2),  and  Bayes  (3) 
costs  as  a  function  of  Pr<H^>.  A  and  C  are  the  values  of 
Pr{Hi>  for  which  the  Neymann-Pear son  cost  and  miniraax  cost 
are  tangent  to  the  Bayes  cost,  and  are  the  prior  pr obab i 1 1 t ies 
at  whicn  the  two  criteria  are  optimal.  For  Pr{Hi>  >  B,  the 
Neyraann-Pear son  cost  becomes  highly  suboptiraal. 
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1)  The  cluster  analysis  and  discriminant  analysis  indices  given  in 

columns  1  and  2  are  the  numbers  by  which  these  events  are  referred 
to  in  the  two  analyses. 
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TABLE  IV-4 

VELOCITY  WINDOWS  USED  IN  FORMING  ENERGY  RATIOS 


P1 : 

First  arrival 

to 

4 . 6  km/sec 

P2  ! 

First  arrival 

to 

4 • 9  km/sec 

Pg  : 

6.  2 

km/ sec 

to 

4.9 

km/sec 

S1  * 

4.6 

km  /  sec 

to 

2.5 

km/sec 

S  2  ! 

4.9 

km/ sec 

to 

2.0 

km/sec 

B  : 

4.9 

km/sec 

to 

3.6 

km/sec 

L®  1 1 

3.6 

km/ se  c 

to 

3.2 

km/sec 

Lg2: 

3.6 

km /sec 

to 

3.2 

km/sec  * 

1 : 

3.2 

km/  se  c 

to 

2.8 

km/ sec 

Kg2: 

3.2 

km/sec 

to 

2.8 

km/ sec  * 

Ratio 

1: 

P  /S 

1  1 

Ratio 

2: 

P„/S„ 

2  2 

Ratio 

3: 

Pg/B 

Ratio 

4: 

Pg/Lg  x 

Ratio 

5: 

Pg/ (Lg1+Rgl> 

Ratio 

6: 

Rg^Lgj 

Ratio 

7: 

Rg2/L82 

The  energy  within  the  windows  Lg  and  Rg 
using  only  the  short-period  vertical  and^ 
components.  For  the  rest  of  the  windows, 
short-period  components  were  used. 


was  compute 
radial 
all  three 


141 


TABLE  IV-5 

BMD07M  scepwlse  linear  discriminant  analysis  summary. 
Full  event  set  (20  earthquakes.  21  explosions). 

7  energy  ratios 


STEP  (IDS bee  0 

TABIA  BLE  EITSBED 

TARIABLES  EOT  INCLUDED  ARO  P  TO  EITEB  -  DEGREES  OP  PRBEDOB  1  S5 
1  15.5225  2  6.9111  3  16.5384  4  20.5477 

5  12.3392  6  0.3182  7  0.0149 


STEP  ROHBE8  1 

TARI ABLE  ERTEBED  4 

TARIABLES  IRCLBDED  AND  P  TO  BESOTS  -  DEGREES  OP  FBEEDOB  1  45 

4  20.5477 

TARIABLES  ROT  INCLUDED  ARD  P  TO  EBTEB  -  DEGREES  OP  PRBEDOB  1  44 

1  0.5601  2  7.1432  3  0.6954  5  4.8120 

6  1.0277  7  0.0614 

NOHBEF  OF  CASES  CLASSIFIED  INTO  GROUP  - 
Q  EQ  X  EX 

GBOOP 

Q  EQ  18  2 

X  EX  12  15 


STEP  N0BBE8  2 

TABI ABLE  ERTEBED  2 

TARIABLES  IBCLODED  AND  P  TO  BEBOTE  -  DEGREES  OP  PREEDOB  1  ** 

2  7.1432  4  20.5779 

TARIABLES  ROT  INCLUDED  ARD  P  TO  BRTEB  -  DEGREES  OP  PREEDOB  1  43 

1  1.2729  3  4.0813  5  0.9350  6  -0.0000 


2 


T 


TABLE  IV-5  (cont'd.) 


STEP  BOHBEB  3 

VARIABLE  EBTEBED  3 

TABIABLES  INCLODED  AMD  F  TO  BEHOVE  -  DEGREES  OP  FBEEDOH 


2  10.8734 


4.0813 


4  14.3201 


VABIABLES  HOT  IBCLODED  AMD  F  TO  ESTER  -  DEGREES  OF  FBEEDOH 


1  0.4096 


1.3209 


0.0000 


0.4472 


GBOOP 
Q  BO 
Z  EX 


STEP  BOHBEB 
VARIABLE  EBTEBED 


NOHBER  OF  CASES  CLASSIFIED  IHTO  GBOOP 
Q  EQ  X  EX 


VARIABLES  IBCLODED  AMD  F  TO  REHOVE  -  DEGREES  OF  FBEEDOH 


5.3338 


4.4233 


4  11.4013 


1.3209 


VARIABLES  MOT  IBCLODED  AMD  F  TO  ESTER  -  DEGBEES  OF  FBEEDOH 


1.0757 


1.2486 


.  7  1.5024 


GBOOP 
Q  EQ 
X  EX 


MOHBEB  OF  CASES  CLASSIFIED  IBTO  GBOOP 
Q  EQ  X  EX 


STEP  MOHBER  5 

VARIABLE  EMTEBED  7 

VARIABLES  INCLODED  AMD  F  TO  BEHOVE  -  DEGREES  OF  FREEDOH 


5.7C16 


7  1.5324 


5.2738 


4  12.9951 


5  2.3773 


VARIABLES  NOT  INCLODED  AND  F  TO  ENTER  -  DEGREES  OF  FREEDOH 


1  40 


1  0.8992 


6  0.2499 


GBOOP 
Q  BO 
X  EX 


MOHBEB  OF  CASES  CLASSIFIED  IBTO  GBOOP  - 
Q  EQ  X  EX 


I  I 

f  I 

t  I 
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TABLE  I V-5  (cont'd.) 


STEP  BOHBER  6 

VARIABLE  ENTERED  1 

T BUBBLES  INCLUDED  AND  P  TO  BEHOVE  -  DEGREES  OP  PBEEDOB  1  40 


1  0.8992 


5  2.9858 


2  6.5647 


7  1.3140 


3  •  3.8146 


4  8.4569 


VARIABLES  NOT  INCLUDED  AND  P  TO  BNTBE  -  DEGREES  OP  PREEDOH  1  39 


6  0.3444 


BOBBER  OP  CASES  CLASSIFIED  INTO  GROUP  - 
9  10  I  El 


GROUP 
Q  EQ 
Z  EZ 


STEP  BOBBER  7 

VARIABLE  ENTERED  6 

VARIABLES  INCLUDED  AND  P  TO  REBOVE  -  DEGREES  OP  PREEDOH 


0.9790 


2.8710 


6.5264 


0.3444 


3.5723 


0.3385 


4  6.8842 


SBOOP 
Q  BO 
Z  EZ 


BOBBER  OP  CASES  CLASSIFIED  INTO  GROUP  - 
0  EQ  Z  EZ 
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TABLE  I V-6 

BMD07M  stepwise  linear  discriminant  analysis  summary. 
Reduced  event  set  (9  earthquakes,  21  explosions). 

7  energy  ratios 


STB?  RUBBER  0 

TABIABLB  EITEBED 

TABZABLES  1ST  INCLUDED  AND  P  TO  ENTER  -  DEGBEES  OP  PBEEDON  1  28 


1  6.40S2 

2 

2.5703 

3 

6.7749 

4  9.2516 

S  6.2126 

6 

0.0759 

7 

1.2271 

STEP  BOHBEB 

1 

TABIABLB  ENTERED 

4 

TABIABLBS  INCLUDED 

AND 

P  TO  RENOTE  -  DEGREES  OP  PBEEDOH  1 

28 

4  9.2516 

TABZABLES  ROT  INCLUDED 

AND  F  TO  ENTEB  -  DEGREES  OP  PBEEDOH 

1  27 

1  0.7125 

2 

3.4369  3  0.1491  5 

1.2782 

6  0.0040 

7 

0.3888 

BOHBEB  OP  CASES  CLASSIFIED  INTO  GBOUP  - 

Q  IQ  Z  EZ 

GBOUP 

0  EQ  8  1 

Z  EZ  7  14 


STEP  BOBBER  2 

TABIABLB  ENTERED  2 

TABZABLES  ZNCLUDED  AND  F  TO  REBOTE  -  DEGBEES  OP  FBEZDOB  1  27 

2  3.4369  4  10.0890 

TABZABLES  ROT  INCLUDED  AND  P  TO  ENTE3 
1  0.0043  3  3.9308 

7  1.2887 

NQflBEB  OP  CASES  CLASSIFIED  INTO  GROUP  - 
Q  EQ  Z  EZ 


-  DEGBEES  OF  PBEEDON  1  26 

5  0.2950  6  0.1988 


GBOUP 
Q  EQ 
Z  EZ 


8 

4 


1 

17 


TABLE  IV-6  (cont'd.) 


STEP  BOBBER  3 

VARIABLE  EITB1ED  7 

VARIABLES  IRCLODED  AHD  P  TO  BEHOVE  -  DEGREES  OP  FBESDOB 


*.3256 


4  10.8487 


1.2887 


VABI ABIES  HOT  IRCLODED  ARD  P  TO  EHTER  -  DEGREES  OP  PBEEDOB 


0.0161 


1.6678 


1.1719 


GBODP 
Q  EQ 
Z  EX 


HOBBEB  OF  CASES  CLASSIFIED  IBTO  GBODP  - 
Q  EQ  X  EX 


0. 1983 


STEP  ROBBER  « 

VARIABLE  ERTERZD  3 

VARIABLES  IRCLODED  ARD  F  TO  BEHOVE  -  DEGREES  OP  FREE DOB 


5.8284 


1.6677 


6.4002 


2.0222 


VARIABLES  ROT  IRCLODED  ARD  P  TO  ERTER  -  DEGREES  OP  PBEEDOB 


0.0545 


2.  1654 


0.5121 


GBODP 
Q  EQ 
X  EX 


ROBBER  OF  CASES  CLASSIFIED  IHTO  GBODP  - 
Q  EQ  X  EX 


STEP  BOBBER  5 

VARIABLE  ERTEBED  5 

VAEIABLES  IRCLODED  ARD  F  TO  REHOVE  -  DEGREES  OP  PBEEDOB 


4.4566 


2.6612 


7.2318 


1  2* 


2.165* 


3.793* 


VARIABLES  ROT  IRCLODED  ARD  P  TO  ERTER  -  DEGREES  OP  PBEEDOB 


1  23 


1  0.0652 


0.0910 


GBODP 
Q  EQ 
X  EX 


HOBBEB  OF  CASES  CLASSIFIED  IBTO  GROOP  - 
Q  EQ  X  EX 
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TABLE  I V-6  (cont'd.) 


STEP  IOHBER  6 

YABIABLE  EHTEBED  6 

V All ABLE S  IMCLODED  AMD  P  TO  BEN09E  -  DEGBEES  OP  FREEDOM  1  23 

2  *.1686  3  2.5076  «  *.7277  5  1.6*85 

6  9.0910  7  1.678* 

TAHXABLES  SOT  IBCLODED  AMD  P  TO  BATES  -  DEGBEES  OP  PBEEOOH  1  22 

1  0.1020 


■OBBBB 

OF  CASES  CLASSIFIED  INTO  GBOOP  - 

6BOOP 

Q  BO 

X  EX 

0  BO 

8 

1 

X  BX 

2 

19 

STEP  BUHBEB 

7 

PABIABLE  ESTEBED 

1 

PABIABLES  IBCLODED 

AMD  F  TO  HBfIOTB  - 

DEGBEES 

OF  PREBDCJH 

1 

22 

1  0.1020 

2  3.8732 

3 

2.3975 

• 

*.1866 

5  1.6523 

6  0.1267 

7 

1. *518 

SOHBBB  OP  CASES  CLASSIFIED  IHTO  GBOUP  - 

g  IQ  I  EX 


6100P 
Q  BQ 
X  BX 


9 

2 


0 

19 


-14  7 


TABLE  IV-7 

BMD07M  stepwise  linear  discriminant  analysis  summary. 

Reduced  event  set  (9  earthquakes,  21  explosions). 

7  energy  ratios  and  M  /  m.  ,  set  1. 

s  b 


STEP  HOUSES  0 

VARIABLE  ENTERED 

TABI ABLES  NOT  INCLUDED  AND  P  TO  BNTEB  -  DEGBEES  OP  PBEEDOH  1  28 


1 

6.4083 

3 

6.7749 

5 

6.2127 

7 

1.2271 

2 

2. 5704 

4 

9. 2518 

6 

0.0760 

8 

214.5560 

STEP  HOBBER 

1 

VARIABLE  ENTERED 

8 

VARIABLES  INCLUDED 

AND 

P  TO  REROVE  - 

DEGREES  OF  FREEDOR  1 

28 

8  214.5560 

VARIABLES  NOT  INCLODED 

AND  P  TO  ENTER 

-  DEGREES  OP  FREEDOR 

1  27 

1  4.2278 

2 

1.4312 

3  1.1862  4 

5.3785 

5  3.7515 

6 

0.4996 

7  3.8333 

HOBBEB  OP  CASES  CLASSIFIED  INTO  GRODP  - 
Q  EQ  X  EX 

GROUP 

Q  EQ  9  0 

X  EX  0  21 


STEP  BOBBER  2 

VARIABLE  ENTERED  4 

VARIABLES  INCLODED  AND  P  TO  HEROVE  -  DEGREES  OP  FREEDOR  1  27 


4  5.3785 

8 

183.8242 

VARIABLES  NOT 

INCLUDED 

AND  P  TO  ENTER 

-  DEGBEES  CP  PREEDOR 

1  26 

1  0.0965 

2 

2.4298 

3  0.9868  5 

0.5922 

6  0.2065 

7 

2.7150 

HOBBEB  OP  CASES  CLASSIFIED  INTO  GBOOP  - 
Q  EQ  X  EX 


GBOOP 
Q  EQ 
X  EX 


9 

0 


0 

21 


TABLE  I V-7  (cont'd.) 


STEP  BOMBER  3 

VARIABLE  ENTERED  7 

VARIABLES  INCLUDED  AND  F  TO  BEHOVE  -  DEGREES  OP  FREEDOM  1 

*  4. 1540  7  2.7150  8  1S5.0327 

VARIABLES  NOT  INCLUDED  AND  P  TO  EBTEB  -  DEGBEIS  CP  PBEEDOH 
1  0.6282  2  4.7524  3  0.5917  5 


6  0.9729 


NUBBEB  OP  CASES  CLASSIFIED  INTO  GBCOP  - 
Q  EQ  I  EZ 

6100  P 

Q  EQ  9  0 

X  EZ  0  21 


STEP  NUBBEB  4 

VARIABLE  ENTERED  2 

VARIABLES  INCLUDED  AND  F  TO  BEBOVE  -  DEGREES  CF  FREEDOM  1  25 

2  4.7524  4  8.8428  7  5.0509  8  191.8542 

VARIABLES  NOT  INCLUDED  AND  P  TO  ENTER  -  DEGREES  CP  FREEDOM  1  24 

1  0.0238  3  0.0003  5  1.8229  6  0.5786 


ROBBER  OP  CASES  CLASSIFIED  INTC  GROUP  - 
Q  EQ  X  EX 

6ROOP 

9  10  9  0 

X  EX  0  21 


STEP  NUMBER  5 

VARIABLE  ENTERED  5 

VARIABLES  INCLUDED  AND  P  TO  SEBOVE  -  DEGREES  CP  FREEDOM 
2  3.2664  4  7.7013  5  1.8229 


8  189.9639 


VARIABLES  NOT  INCLUDED  AND  P  TO  ENTER  -  DEGREES  CP  PBEEDOH  1  23 

1  0.4217  3  0.1677  6  0.0311 

NUBBEB  OP  CASES  CLASSIFIED  INTO  GBCOP  • 

Q  BO  X  EX 

GROUP 

0  EQ  9  0 

X  EX  0  21 


1  24 

7  6.9314 


26 

1  25 

3.1773 


-1 


TABLE  I V-7  (cont'd.) 


STEP  10H  BER  6 

TABIABLS  EHTEE  ED  1 

TABIABLES  INCLUDED  AMO  F  TO  REnOTB  -  DEGREES  OP  PBEEDOR 
1  0.1217  2  3.5950  4  5.4019 

7  6.6603  8  185.2673 


1  23 

5  2.1758 


TABIABLES  HOT  INCLOSED  AMD  T  TO  ENTER  -  DEGREES  CF  FREEDOR  1  22 

3  0.1637  6  0.0990 

BOBBER  OP  CASES  CLASSIFIED  INTO  GRCOP  - 
Q  EQ  X  EX 

9  0 

0  21 


STEP  ROBBER  7 

TASI ABLE  ENTERED  3 

TABIABLES  IHCLODED  AMD  F  TO  BESOTS  -  DEGREES  OF  PBEEDOfl  1  22 

1  0.4067  2  3.5936  3  0.1637  4  5.3094 


5  2.2603  7  6.3169  8  158.6258 

TABIABLES  NOT  IMCLODED  AND  F  TO  EHTeS  -  DEGEEFS  CF  FEEEDOB  1  21 

6  0.0879 

BOBBEB  OP  CASES  CLASSIFIED  INTO  GBOUP  - 
0  EQ  X  EX 

GHOOP 

Q  BQ  9  0 

X  EX  0  21 


STEP  BOBBEB  8 

TABIABLE  ENTERED  6 


TABIABLES  IHCLODED  AND  P  TO  BEBOTE  -  DEGREES  OP  PBEEDOB  1  21 

1  0.4524  3  0.1497  5  1.6573  7  3.0042 

2  3.4728  «  3.9643  6  0.0879  8  151.1451 


BOBBER  OP  CASES  CLASSIFIED  INTO  GBOOP  - 
Q  EQ  X  EX 

GBOOP 

Q  EQ  9  0 

X  EX  0  21 


GBOOP 
Q  BQ 
X  EX 


TABLE  I V-8 


BMD07M  stepwise  linear  discriminant  analysis  suanar 
Reduced  event  set  (9  earthquakes,  21  explosions). 
7  energy  ratios  and  M  /■  set  2. 


STEP  NOBBEB  0 

VARIABLE  EHTEBED 


TABX1BLES  NOT 

INCLODED 

AHD  P  TO  ENTEB 

-  DEGBIIS  Or  PBEEDOB 

1  28 

1  6.4083 

3 

6.7749 

5  6.2127 

7 

1.2271 

2  2.5704 

4 

9.2518 

6  0.0760 

8 

115.5020 

STEP  NOBBEB  1 

VARIABLE  ENTERED  8 

VARIABLES  INCLODED  AND  F  TO  REHOVE  -  DEGREES  OP  PBEEDOR  1  26 


8  115.5020 

VARIABLES  HOT  IHCLDDED  AHD  F  TO  ENTEB  -  DEG8HES  OF  PBEEDOB  1  27 


1 

12.1118 

2 

8.5479 

3 

10.1405 

4  8.2551 

5 

9.3989 

6 

1.7685 

7 

0.3380 

bobbeb 
Q  EO 

OP  CASES  CLASSIFIED 
X  EX 

INTO 

GBOOP  - 

GBOOP 


Q  EQ  9  0 

X  EX  0  21 


STEP  I0BBEB  2 

VARIABLE  EHTEBED  1 


VARIABLES  INCLODED 

AND 

P  TO  BEHOVE  -  DEGSEES 

OP  PBEEDOB  1 

27 

1  12.1118 

a 

136.1186 

▼ABXABLES  HOT  INCLODED 

AND  P  TO  ENTEB 

-  DEGSEES  OP  PBEEDOB 

1  26 

2  0.0628 

3 

0.6956 

4 

1.0353  5 

0.1996 

6  0.6064 

7 

0.0315 

BuflBBI 

0  BQ 

GBOOP 

Q  BQ  9 

X  EX  0 

OP  CASES  CLASSIFIED 
X  EX 

0 

21 

INTO 

GBOOP  - 

151 


TABLE  I V-8  (cont'd.) 


STEP  ROBBER  3 

VARIABLE  ENTERED  4 

VARIABLES  IRCLODED  AND  P  TO  REBOVE  -  DEGREES  OP  FBEEDOfl  1  26 

1  3.9929  4  1.0353  8  120.9839 

VARIABLES  ROT  INCLUDED  AMD  P  TO  ENTER  '  DEGREES  OP  PBEEDOR  1  25 

2  0.0021  3  1.2139  5  0.0228  6  0.3319 


7  0.2162 


ROBBER  OF  CASES  CLASSIFIED  IHTO  GROUP  - 
Q  EQ  Z  EX 

OIOO  P 

Q  EQ  9  0 

X  EX  0  21 


STEP  ROBBER  4 

VARIABLE  ENTERED  3 

VARIABLES  IRCLODED  AND  F  TO  REHOVE  -  DEGREES  OP  PREBDOH  1  25 

1  3.2136  3  1.2139  4  1.5474  8  121.5642 


VARIABLES  ROT  IRCLODED  AMD  P  TO  ENTER  -  DEGREES  OP  FREEDOM  1  24 

2  0.1019  5  -0.0002  6  0.3314  7  0.0544 

ROBBER  OP  CASES  CLASSIFIED  INTO  GROUP  - 
Q  IQ  X  EX 

GROOP 

Q  EQ  9  0 

X  EX  0  21 


STEP  BOBBER  5 

VARIABLE  ENTERED  6 

VARIABLES  IRCLODED  AND  P  TO  BEHOVE  -  DEGREES  OP  PREEDOB  1  24 

1  2.4207  3  1.1784  4  1.1714  6  0.3314 


8  118.4616 


VARIABLES  ROT  INCLODED  AND  P  TO  ENTER  -  DEGREES  CP  PREEDOB  1  23 


2  0.1690  5  0.4007.  7  0.8938 


ROBBER  OP  CASES  CLASSIFIED  INTO  GBCOP  - 
Q  BO  X  EX 

6ROOP 

Q  10  9  0 

X  EX  0  21 
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TABLE  I V-8  (cont'd.) 


STD  HOBBES  6 

V ASX  ABLE  ENTERED  7 

VARIABLES  XHCLOOED  AND  F  TO  BSNOVB  -  DESBEES  OP  PBEEDOB  1 

1  3.0411  3  0.5367  4  1.3800  6 

7  0.6938  8  110.5479 


VARIABLES  NOT  INCLUDED  AND  P  TO  ENTER  -  DEGREES  OP  PBEEDOB 

2  0.0344  5  0.5757 

R08BEB  OP  CASES  CLASSIFIED  INTO  GROUP  - 
Q  Eg  X  EX 

GBOOP 

Q  EQ  9  0 

X  EX  0  21 


STEP  BOBBEB  7 

VARIABLE  ENTERED  5 

VARIABLES  INCLUDED  AND  P  TO  BEBOVB  -  DEGREES  CP  PBEEDOB  1  22 

1  3.5437  3  0.7102  4  0.0343  5  0.5757 


6  1.7097  7  1.0514  8  100.5857 

VARIABLES  HOT  INCLUDED  AND  P  TO  ENTEB  -  DEGBEES  OP  PBEEDOB  1  21 

2  0.0534 

BOBBEB  OP  CASES  CLASSIFIED  INTO  GROUP  - 
Q  EQ  X  EX 

GBOOP 

Q  EQ  9  0 

X  EX  0  21 


23 

1.1689 

1  22 


STEP  BOBBEB  8 

VARIABLE  ENTERED  2 


VARIABLES  INCLUDED 

AND  F  TO  BEHOVE  - 

DEGBEES  CP  PBEEDOB 

1 

21 

1  3.3340 

3  0.7310 

5  0.5706 

7 

0.8444 

2  0.0534 

4  0.0182 

6  1.6286 

8 

78.7544 

RUBBER  OP  CASES  CLASSIFIED  INTO  GBOOP  - 
0  EQ  X  EX 

GROUP 

Q  IQ  9  0 

X  IX  0  21 
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TABLE  IV-9 

BMD07M  stepwise  linear  discriminant  analysis  summary. 
Reduced  event  set  (9  earthquakes,  21  explosions). 

7  energy  ratios  and  Mg/n^,  set  3. 


STEP  BOBBER  0 

VARIABLE  ENTERED 


VABIABLBS  NOT 

INCLUDED 

AND  P  TO  ENTER 

-  DEGREES  OP 

PREEOOB 

1  28 

1  6.4083 

3 

6.7749 

5  6.2127 

7 

1.2271 

2  2.5704 

4 

9.2518 

6  0.0760 

8 

74.7827 

STEP  BBHBEg  1 

TABIABLE  EBTEBED  8 

VARIABLES  IHCLODED  AND  V  TO  BEHOVE  -  DEGREES  OP  PREEDOH  1 
8  7*.  7827 

VARIABLES  NOT  INCLUDED  AND  P  TO  ENTER  -  DEGREES  OF  PREEDOH 
1  14.2533  2  7.2224  3  19.5475  4 


5  11.5038  6  0.0014  7  0.1908 


ROBBER  OP  CASES  CLASSIFIED  INTO  GBCOP  - 
Q  EQ  X  EX 

GROUP 

0  EQ  9  0 

X  EX  1  20 


STEP  ROBBER  2 

VARIABLE  ENTERED  3 

VARIABLES  IHCLODED  AND  P  TO  BEHOVE  -  DEGREES  OP  PREEDOH  1  27 


3  19.5476  8  110.5786 

VARIABLES  NOT  IHCLODED  AND  P  TO  ENTER  -  DEGREES  OP  PREEDOH  1  26 


1 

0.6249 

2 

9.2939 

4 

0.4700 

$ 

0.0675 

6 

0.2538 

7 

0.6005 

ROBBER 
Q  EQ 

OP  CASES  CLASSIFIED 
X  EX 

INTO 

GROUP  - 

28 

1  27 

14.0003 


GROOP 
Q  EQ 
X  IX 


9 

0 


0 

21 


I 


-154- 


TABLE  IV-9  (cont'd.) 


STEP  IOHBEB 
VARIABLE  ENTERED 


VARIABLES  INCLUDED  AND  F  TO  RENOTE  -  DEGREES  OF  FREE DOB 


0.6249 


4.0418 


8  106.8292 


VARIABLES  NOT  INCLUDED  AND  F  TO  ENTER  -  DEGRESS  OF  FREBDOB 


1  25 


3.2717 


0.0001 


0.7387 


0.4122 


1.2722 


GROUP 
Q  EQ 
Z  EZ 


ROBBER  OF  CASES  CLASSIFIED  INTO  GROUP  - 
Q  EQ  I  EX 


STEP  BUBBEB  4 

VARIABLE  ENTERED  2 


VARIABLES  INCLUDED  AND  F  TO  BEHOVE  -  DEGREES  OF  FREBDOB  1  25 


1  3.6276 


2  3.2717 


3  6.0983 


8  104.8908 


VARIABLES  NOT  INCLUDED  AND  F  TO  BNTEB  -  DEGREES  OF  FREEDOB  1  24 


4  0.0173 


5  0.4550 


6  0.1527 


7  0.5616 


GROUP 
Q  EQ 
I  BZ 


RUBBER  OF  CASES  CLASSIFIED  INTO  GROUP  - 
Q  EQ  X  BZ 


STEP  RUBBER 
VARIABLE  ENTERED 


VARIABLES  INCLUDED  AND  F  TO  BEHOVE  -  DEGREES  OF  FREBDOB 


3.  7354 


2  2.4308 


3  4.3818 


0.5616 


8  97.4571 


VARIABLES  ROT  INCLUDED  AND  F  TO  ENTER  -  DEGREES  CP  FREEDOB 


1  23 


4  0.0050 


0.2534 


0.0419 


GROUP 
0  >Q 
Z  BZ 


ROBBER  OF  CASES  CLASSIFIED  XNTC  GROUP  - 
Q  EQ  I  EZ 


TABLE  IV-9  (cont'd.) 
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ST  BP  BOBBER  6 

TAMABLE  ENTERED  5 

TABIABLES  INCLODED  AND  P  TO  8EB0TE  -  DEGREES  OP  TBEEDOB  1  23 

1  2.7436  2  2.2458  3  4.4862  5  0.2534 


7  0.3548  8  94.6508 

VARIABLES  NOT  INCLODED  AND  P  TO  ENTER  -  DEGREES  OP  FREEDOM  1  22 

4  0.0945  6  0.2211 

BOBBER  OP  CASES  CLASSIFIED  INTO  GFCur  - 
Q  EQ  X  EX 

GROOP 

Q  BO  9  0 

X  EX  0  21 


STEP  BOBBER  7 

VARIABLE  ENTERED  6 

VARIABLES  INCLODED  AND  P  TO  8BNOVE  -  DEGBEES  CP  PRES DOB  1  22 

1  2.8315  2  2.0756  3  4.2000  5  0.4251 

6  0.2211  7  0.5570  8  86.6240 


VARIABLES  NOT  INCLODED  AND  P  TO  ENTER  -  DEGREES  OP  PREEDOB  1  21 

4  0.4590 

BOBBER  OP  CASES  CLASSIFIED  INTO  GROOP  - 

Q  EQ  I  EX 

GROOP 

0  EO  9  0 

X  EX  0  21 


ST BP  BOBBER  8 

VABIABLE  ENTERED  4 


VARIABLES  INCLODED  AND  P  TO  REBOVE  -  DEGBEES  OP  PREEDOB  1  21 


2.  2404 

3  4.0976 

5 

0.8714 

7 

0.5601 

2.2193 

4  0.4590 

6 

0.5819 

8 

68.0162 

BOBBER 

OP  CASES  CLASSIFIED 

INTO 

GROOP  - 

Q  sg 

X  EX 

GROOP 

g  EQ  9  0 

X  EX  0  21 
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TABLE  IV-10 

BMD07M  stepwise  linear  discriminant  analysis  summary* 
Reduced  event  set  (9  earthquakes,  21  explosions)* 

7  energy  ratios  and  Ma/m^,  set  4. 


STEP  iobbeb  0 

VABXABLE  ENTERED 


VARIABLES  MOT  INCLUDED  AND  F  TO  ENTE8  -  DECREES  OF  PBEEDOR  1  28 


1  6.4083 

2  2.5704 


3  6.7749  5  6.2127 

4  9.2518  6  0.0760 


7  1.2271 

8  43.6063 


STEP  IOBBEB  1 

V All ABLE  ENTERED  8 


T1IX1BLES  INCLUDED  AND 

F  TO  REMOVE  - 

DEGREES  OF  FREEDOM  1 

28 

8  43.6063 

VARIABLES  NOT 

INCLUDED 

AND  F  TO  ENTER 

-  DEGREES  OF  FREEDOM 

1  27 

1  6.2522 

2 

4.  1747 

3  12.8523  4 

7.8121 

5  8.2655 

6 

0.7123 

7  0.4637 

IOBBEB  OP  CASES  CLASSIFIED  INTO  GRUOP  - 
Q  EQ  X  EX 

6100  P 

Q  EQ  9  0 

X  EX  1  20 


STEP  IOBBEB  2 

VARIABLE  EITEBED  3 

TABIABLES  INCLUDED  AND  F  TO  BEBOTE  -  DEGREES  OP  FBEEDOB  1  27 

3  12.8523  8  55.0613 

VABIABLZS  IOT  INCLUDED  1 ID  F  TO  ENTER  -  DEGREES  OF  FREEDOB  1  26 

1  0.0500  2  0.6439  4  0.0133  5  0.0682 


6  0.2589  7  0.3431 

IOBBEB  OF  CASES  CLASSIFIED  INTO  GROUP  - 
Q  EQ  X  EX 

8I00P 

Q  EQ  9  0 

X  EX  0  21 
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TABLE  IV-10  (cont'd.) 


STEP  IOBBEB  3 

VARIABLE  EBTEBEO  2 

variables  included  and  r  to  behove  -  degrees  op  preedob  i 

2  0.6439  3  8.0602  8  54.1304 

VARIABLES  HOT  IHCLODED  AND  P  TO  ENTER  -  DEGREES  CP  PREEDOB 
1  0.3497  4  0.8225  5  1.2859  6 

7  0.9414 

NOHBEB  OP  CASES  CLASSIFIED  ZNTO  GROUP  - 
g  EQ  X  EX 

GROOP 

0  EQ  9  0 

X  EX  0  21 


STEP  IOBBEB  4 

VARIABLE  ENTERED  5 

VARIABLES  INCLUDED  AND  P  TO  REHOVE  -  DEGREES  OP  PREEDOB 
2  1.8664  3  4.7101  5  1.2859 

VARIABLES  NOT  INCLUDED  AN D  P  TO  ENTER  -  DEGREES  OP  PREEDOB 

1  0.1152  4  0.0005  6  0. 1 180 


RUBBER  OP  CASES  CLASSIFIED  INTO  GROUP  - 
Q  EQ  X  EX 

GROUP 

Q  EQ  9  0 

X  EX  1  20 


STEP  NUBBER  5 

VARIABLE  ENTERED  7 

VARIABLES  INCLUDED  AND  F  TO  BEHOVE  -  DEGREES  OP  PREEDOB  1 

2  2.0666  3  5.1251  5  0.8372  7 


8  46.8682 


VARIABLES  NOT  INCLUDED  AND  P  TO  SNTB8  -  DEGREES  CP  PREEDOB 

1  0.051S  4  0.1484  6  0.0444 

NUBBER  OP  CASES  CLASSIFIED  INTO  GROUP  - 
Q  EQ  X  EX 

GROUP 

Q  EQ  9  0 

X  IE  1  20 


26 

1  25 

0.4960 


25 

49.9242 
1  24 

0.5118 


24 

0.5118 

1  23 


r 

i 
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TABLE  IV-10  (cont'd.) 


ST  BP  *011  BEE  6 

VARIABLE  ENTERED  4 

VARIABLES  INCLUDED  AND  F  TO  REMOVE  -  DEGREES  OP  PR EE DOR 


1  23 


2.  1392 
0.6415 


4.9303 


8  29.5269 


0.1484 


0.0233 


V1BZABLES  NOT  INCLUDED  AND  P  TO  ENTER  -  DEGREES  CP  FREEDOM 


0. 1121 


0.0013 


GBOOP 
0  BO 
X  EZ 


BOMBER  OP  CASES  CLASSIFIED  INTO  GROUP  - 
Q  EQ  X  EX 


STEP  NUMBER  7 

VARIABLE  ENTERED  1 

VARIABLES  INCLUDED  AND  P  TO  REMOVE  -  DEGREES  OP  FREEDOM 


0.1121 


0.0668 


1.3623 


0.6063 


4.7352 


8  28.3561 


0.2050 


VARIABLES  NOT  INCLUDED  AND  P  TO  ENTER  -  DEGREES  OP  FREEDOM 


0.0018 


CROUP 
0  EQ 
X  EX 


BOMBER  OP  CASES  CLASSIFIED  INTO  GBOOP 

Q  EQ  X  BX 
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